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POLYKETIDE SYNTHASE ENZYMES AND RECOMBINANT DNA 
CONSTRUCTS THEREFOR 



5 Field of the Invention 

The present invention relates to polyketides and the polyketide synthase (PKS) 
enzymes that produce them. The invention also relates generally to genes encoding PKS 
enzymes and to recombinant host cells containing such genes and in which expression of 
such genes leads to the production of polyketides. The present invention also relates to 
10 compounds useful as medicaments having immunosuppressive and/or neurotrophic 
activity. Thus, the invention relates to the fiei ' jf chemistry, molecular biology, and 
agricultural, medical, and veterinary 7 technology. 

Background of the Invention 

15 Polyketides are a class of compounds synthesized from 2-carbon units through a 

series of condensations and subsequent modifications. Polyketides occur in many types 
of organisms, including fungi and mycelial bacteria, in particular, the actinomycetes. 
Polyketides are biologically active molecules with a wide variety of structures, and the 
class encompasses numerous compounds with diverse activities. Tetracycline, 

20 erythromycin, epothilone, FK-506, FK-520, narbomycin, picromycin, rapamycin, 
spinocyn, and tylosin are examples of polyketides. Given the difficulty in producing 
polyketide compounds by traditional chemical methodology, and the typically low 
production of polyketides in wild-type cells, there has been considerable interest in 
finding improved or alternate means to produce polyketide compounds. 

25 This interest has resulted in the cloning, analysis, and manipulation by 

recombinant DNA technology of genes that encode PKS enzymes. The resulting 
technology allows one to manipulate a known PKS gene cluster either to produce the 
polyketide synthesized by that PKS at higher levels than occur in nature or in hosts that 
otherwise do not produce the polyketide. The technology also allows one to produce 

30 molecules that are structurally related to, but distinct from, the polyketides produced 
from known PKS gene clusters. See, e.g., PCT publication Nos. WO 93/13663; 
95/08548; 96/40968; 97/02358; 98/27203; and 98/49315; United States Patent Nos. 
4,874,748; 5,063,155; 5,098,837; 5,149,639; 5,672,491; 5,712,146; 5,830,750; and 
5,843,718; and Fu et aU 1994, Biochemistry 33: 9321-9326; iMcDaniel et al H 1993, 

35 Science 262: 1546-1550; and Rohr, 1995, Angew. Chem. Int. Ed. Engl 34(8); 881-888, 
each of which is incorporated herein by reference. 
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Polyketides are synthesized in nature by PKS enzymes. These enzymes, which 
are complexes of multiple large proteins, are similar to the synthases that catalyze 
condensation of 2-carbon units in the biosynthesis of fatty acids. PKSs catalyze the 
biosynthesis of polyketides through repeated, decarboxylative Claisen condensations 
5 between acylthioester building blocks. The building blocks used to form complex 
polyketides are typically acylthioesters, such as acetyl butyryl, propionyl, malonyl, 
hydroxymalonyl, methylmalonyl, and ethylmalonyl CoA. Other building blocks include 
amino acid like acylthioesters. PKS enzymes that incorporate such building blocks 
include an activity that functions as an amino acid ligase (an AMP ligase) or as a non- 
10 nbosomal ; -tide synthetase (NRPS). Two major types of PKS enzymes are known; 
these differ in their composition and mode of synthesis of the polyketide synthesized. 
These two major types of PKS enzymes are commonly referred to as Type 1 or 
"modular" and Type II "iterative" PKS enzymes. 

In the Type I or modular PKS enzyme group, a set of separate catalytic active 
15 sites (each active site is termed a "domain", and a set thereof is termed a "module") exists 
for each cycle of carbon chain elongation and modification in the polyketide synthesis 
pathway. The typical modular PKS is composed of several large polypeptides, which can 
be segregated from amino to carboxy termini into a loading module, multiple extender 
modules, and a releasing (or thioesterase) domain. The PKS enzyme known as 6- 
20 deoxyerythronolide B synthase (DEBS) is a Type I PKS. In DEBS, there is a loading 
module, six extender modules, and a thioesterase (TE) domain. The loading module, six 
extender modules, and TE of DEBS are present on three separate proteins (designated 
DEBS-1, DEBS-2, and DEBS-3, with two extender modules per protein). Each of the 
DEBS polypeptides is encoded by a separate open reading frame (ORF) or gene; these 
25 genes are known as eryAI, eryAII, and eryAIIL See Caffrey et a/., 1992, FEBS Letters 
304: 205, and U.S. Patent No. 5,824,513, each of which is incorporated herein by 
reference. 

Generally, the loading module is responsible for binding the first building block 
used to synthesize the polyketide and transferring it to the first extender module. The 

30 loading module of DEBS consists of an acy [transferase (AT) domain and an acyl carrier 
protein (ACP) domain. Another type of loading module utilizes an inactivated 
ketosynthase (KS) domain and AT and ACP domains. This inactivated KS is in some 
instances called KS Q , where the superscript letter is the abbreviation for the amino acid, 
glutamine, that is present instead of the active site cysteine required for ketosynthase 

35 activity. In other PKS enzymes, including the FK-506 PKS, the loading module 
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incorporates an unusual starter unit and is composed of aCoA ligase like activity 
domain. In any event, the loading module recognizes a particular acyl-CoA (usually 
acetyl or propionyl but sometimes butyryl or other acyl-CoA) and transfers it as a thiol 
ester to the ACP of the loading module. 
5 The AT on each of the extender modules recognizes a particular extender-CoA 

(malonyl or alpha-substituted malonyl, i.e., methylmalonyl, ethylmaionyl, and 2- 
hydroxymalonyl) and transfers it to the ACP of that extender module to form a thioester. 
Each extender module is responsible for accepting a compound from a prior module, 
binding a building block, attaching the building block to the compound from the prior 

iU module, opLjonady performing one or more additional i.^uwiis. and transferring 
resulting compound to the next module. 

Each extender module of a modular PKS contains a PCS. AT, ACP, and zero, one, 
two, or three domains that modify the beta-carbon of the grow ing polyketide chain. A 
typical (non-loading) minimal Type I PKS extender module is exemplified by extender 

15 module three of DEBS, which contains a KS domain, an AT domain, and an ACP 
domain. These three domains are sufficient to activate a 2-carbon extender unit and 
attach it to the growing polyketide molecule. The next extender module, in turn, is 
responsible for attaching the next building block and transferring the growing compound 
to the next extender module until synthesis is complete, 

20 Once the PKS is primed with acyl- and malonyl-ACPs, the acyl group of the 

loading module is transferred to form a thiol ester (trans-estenfication) at the KS of the 
first extender module; at this stage, extender module one possesses an acyl-KS and a 
malonyl (or substituted malonyl) ACP. The acyl group derived from the loading module 
is then covalently attached to the alpha-carbon of the malonyl group to form a carbon- 

25 carbon bond, driven by concomitant decarboxylation, and generating a new acyl-ACP 
that has a backbone two carbons longer than the loading building block (elongation or 
extension). 

The polyketide chain, growing by two carbons each extender module, is 
sequentially passed as covalently bound thiol esters from extender module to extender 
30 module, in an assembly line-like process. The carbon chain produced by this process 

alone would possess a ketone at every other carbon atom, producing a polyketone, from 
which the name polyketide arises. Most commonly, however, additional enzymatic 
activities modify the beta keto group of each two carbon unit just after it has been added 
to the growing polyketide chain but before it is transferred to the next module. 
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Thus, in addition to the minimal module containing KS, AT, and ACP domains 
necessary to form the carbon-carbon bond, and as noted above, other domains that 
modify the beta-carbonyl moiety can be present. Thus, modules may contain a 
kctoreductase (KR) domain that reduces the keto group to an alcohol. Modules may also 
5 contain a KR domain plus a dehydratase (DH) domain that dehydrates the alcohol to a 
double bond. Modules may also contain a KR domain, a DH domain, and an 
enoylreductase (ER) domain that converts the double bond product to a saturated single 
bond using the beta carbon as a methylene function. An extender module can also 
:^r;ain other enzyrmric activi ' \ :uch '»s. for example, a methylase or dimethylase 
10 activity. 

After traversing the final extender module, the polyketide encounters a releasing 
domain that cleaves the polyketide from the PKS and typically cyclizcs the polyketide. 
For example, final synthesis of 6-dEB is regulated by aTE domain located at the end of 
extender module six. In the synthesis of 6-dEB, the TE domain catalyzes cyclization of 

15 the macrolide ring by formation of an ester linkage. In FK-506. FK-520. rapamycin, and 
similar poiyketides, the TE activity is replaced by a RapP (for rapamycin) or RapP like 
activity that makes a linkage incorporating a pipecolate acid residue. The enzymatic 
activity that catalyzes this incorporation for the rapamycin enzyme is known as RapP, 
encoded by the rapP gene. The polyketide can be modified further by tailoring enzymes; 

20 these enzymes add carbohydrate groups or methyl groups, or make other modifications, 
i.e., oxidation or reduction, on the polyketide core molecule. For example, 6-dEB is 
hydroxylated at C-6 and C-12 and glycosylated at Co and Co in the synthesis of 
erythromycin A. 

In Type I PKS polypeptides, the order of catalytic domains is conserved. When 
25 all beta-keto processing domains are present in a module, the order of domains in that 
module from N-to-C-terminus is always KS, AT, DH, ER, KR, and ACP. Some or all of 
the beta-keto processing domains may be missing in particular modules, but the order of 
the domains present in a module remains the same, The order of domains within modules 
is believed to be important for proper folding of the PKS polypetides into an active 
30 complex. Importantly, there is considerable flexibility in PKS enzymes, which allows for 
the genetic engineering of novel catalytic complexes. The engineering of these enzymes 
is achieved by modifying, adding, or deleting domains, or replacing them with those 
taken from other Type I PKS enzymes. It is also achieved by deleting, replacing, or 
addine entire modules with those taken from other sources. A genetically engineered 
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PKS complex should of course have the ability to catalyze the synthesis of the product 
predicted from the genetic alterations made. 

Alignments of the many available amino acid sequences for Type I PKS enzymes 
has approximately defined the boundaries of the various catalytic domains. Sequence 
alignments also have revealed linker regions between the catalytic domains and at the N- 
and C-termini of individual polypeptides. The sequences of these linker regions are less 
well conserved than are those for the catalytic domains, which is in part how linker 
regions are identified. Linker regions can be important for proper association between 
domains and between the individual polypeptides that comprise the PKS complex. One 
can thus view the linkers and domains together as creating a scaffold on which the 
domains and modules are positioned in the correct orientation to be active. This 
organization and positioning, if retained, permits PKS domains of different or identical 
substrate specificities to be substituted (usually at the DNA level) between PKS enzymes 
by various available methodologies. In selecting the boundaries of, for example, an AT 
replacement, one can thus make the replacement so as to retain the linkers of the 
recipient PKS or to replace them with the linkers of the donor PKS AT domain, or, 
preferably, make both constructs to ensure that the correct linker regions between the KS 
and AT domains have been included in at least one of the engineered enzymes. Thus, 
there is considerable flexibility in the design of new PKS enzymes with the result that 
known polyketides can be produced more effectively, and novel polyketides useful as 
pharmaceuticals or for other purposes can be made. 

By appropriate application of recombinant DNA technology, a wide variety of 
polyketides can be prepared in a variety of different host cells provided one has access to 
nucleic acid compounds that encode PKS proteins and polyketide modification enzymes. 
The present invention helps meet the need for such nucleic acid compounds by providing 
recombinant vectors that encode the FK-520 PKS enzyme and various FK-520 
modification enzymes. Moreover, while the FK0O6 and FK-520 polyketides have many 
useful activities, there remains a need for compounds with similar useful activities but 
with better pharmacokinetic profile and metabolism and fewer side-effects. The present 
invention helps meet the need for such compounds as well. 

Summary of the Invention 
In one enfu odiment, the present invention provides recombinant DNA vectors 
that encode all or part of the FK-520 PKS enzyme. Illustrative vectors of the invention 
include cosmid pKOS034-120, pKOS034-124, pKOS065-C31, pKOS065-C3, 
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pKOS065-M27, and pKOS065-M21 . The invention also provides nucleic acid 
compounds that encode the various domains of the FK-520 PKS, i.e., the KS, AT, ACP, 
KR. DH, and ER domains. These compounds can he readily used, alone or in 
combination with nucleic acids encoding other FK-520 or non-FK-520 PKS domains, as 
5 intermediates in the construction of recombinant vectors that encode all or part of PKS 
enzymes that make novel polyketides. 

The invention also provides isolated nucleic acids that encode all or part of one or 
more modules of the FK-520 PKS. each module comprising a ketosynthase activity, an 
acvl transferase activity, nnd an acyl earner protein activity. The invention provides an 

1 0 isolated nucleic acid that encodes one or more open reading frames of FK-520 PKS 

genes, said open reading frames comprising coding sequences for a CoA Hgase activity, 
an NRPS activity, or two or more extender modules. The invention also provides 
recombinant expression vectors containing these nucleic acids. 

In another embodiment, the invention provides isolated nucleic acids that encode 

15 all or a part of a PKS that contains at least one module in which at least one of the 

domains in the module is a domain from a non-FK-520 PKS and at least one domain is 
from the FK-520 PKS. The non-FK-520 PKS domain or module originates from the 
rapamycin PKS, the FK-506 PKS, DEBS, or another PKS. The invention also provides 
recombinant expression vectors containing these nucleic acids, 

20 In another embodiment, the invention provides a method of preparing a 

polyketide, said method comprising transforming a host cell with a recombinant DNA 
vector that encodes at least one module of a PKS, said module comprising at least oue 
FK-520 PKS domain, and culturing said host cell under conditions such that said PKS is 
produced and catalyzes synthesis of said polyketide. In one aspect, the method is 

25 practiced with a Streptomyces host cell. In another aspect, the polyketide produced is 

FK-520. In another aspect, the polyketide produced is a polyketide related in structure to 
FK-520. In another aspect, the polyketide produced is a polyketide related in structure to 
FK-506 or rapamycin. 

In another embodiment, the invention provides a set of genes in recombinant 

30 form sufficient for the synthesis of ethylmalonyl CoA in a heterologous host cell. These 
genes and the methods of the invention enable one to create recombinant host cells with 
the ability to produce polyketides or other compounds that require ethylmalonyl CoA for 
biosynthesis. The invention also provides recombinant nucleic acids that encode AT 
domains specific for ethylmalonyl CoA. Thus, the compounds of the invention can be 
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used to produce poiyketides requiring ethylmalonyl CoA in host cells that otherwise are 
unable to produce such poiyketides. 

In another embodiment, the invention provides a set of genes in recombinant 
form sufficient for the synthesis of 2-hydroxymalonyl CoA and 2-methoxymaIonyI CoA 
in a heterologous host cell. These genes and the methods of the invention enable one to 
create recombinant host cells with the ability to produce poiyketides or other compounds 
that require 2-hydroxymalonyl CoA for biosynthesis. The invention also provides 
recombinant nucleic acids that encode AT domains specific for 2-hydroxymalonyl CoA 
and 2-methoxymalonyl CoA. Thus, the compounds of the invention can be used to 
produce ^olyketides requiring 2-hydroxymalonyl CoA or 2-methoxymalonyl CoA in 
host cells that are otherwise unable to produce such poiyketides. 

In another embodiment, the invention provides a compound related in structure to 
FK-520 or FK-506 that is useful in the treatment of a medical condition. These 
compounds include compounds in which the C-13 methoxy group is replaced by a 
moiety selected from the group consisting of hydrogen, methyl, and ethyl moieties. Such 
compounds are less susceptible to the main in vivo pathway of degradation for FK-520 
and FK-506 and related compounds and thus exhibit an improved pharmacokinetic 
profile. The compounds of the invention also include compounds in which the C-15 
methoxy group is replaced by a moiety selected from the group consisting of hydrogen, 
methyl, and ethyl moieties. The compounds of the invention also include the above 
compounds further modified by chemical methodology to produce derivatives such as, 
but not limited to, the C-18 hydroxy! derivatives, which have potent neurotrophin but not 
imrnunosupprssion activities. 

Thus, the invention provides poiyketides having the structure: 



wherein, R| is hydrogen, methyl, ethyl, or allyl; R? is hydrogen or hydroxyl, provided 
that when R;> is hydrogen, there is a double bond between C-20 and C-19; R3 is hydrogen 




'OH 
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or hydroxyl; R 4 is methoxyl, hydrogen, methyl, or ethyl: and R< is methoxyl, hydrogen, 
methyl, or ethyl; but not including FK-506, FK-520, lS-hydroxy-FK-520, and 18- 
hydroxy-FK-506. The invention provides these compounds in purified form and in 
pharmaceutical compositions. 
5 In another embodiment, the invention provides a method for treating a medicai 

condition by administering a pharmaceutical!)' efficacious dose of a compound of the 
invention. The compounds of the invention may be administered to achieve 
immunosuppression or to stimulate nerve growth and regeneration. 

These and other embodiments and a^ n er~ ^ r, he invention will be more fully 
1 0 understood after consideration of the attached Drawings and th ; brief description 
below, together with the detailed description, examples, and claims that follow. 



Brief Descnption of the Drawings 
Figure 1 shows a diagram of the FK-520 biosynthetic gene cluster. The top line 

15 provides a scale in kilobase pairs (kb). The second line shows a restriction map with 
selected restriction enzyme recognition sequences indicated. K is Kpnl; X isXliol, S is 
Sacl\ P is Psrl; and E is EcoRl. The third line indicates the position of FK-520 PKS and 
related genes. Genes are abbreviated with a one letter designation, i.e., C is fkbC. 
Immediately under the third line are numbered segments showing where the loading 

20 module (L) and ten different extender modules (numbered 1-10) are encoded on the 

various genes shown. At the bottom of the Figure, the DNA inserts of various cosmids of 
the invention (i.e., 34-124 is cosmid pKOS034T 24) are shown in alignment with the 
FK-520 biosynthetic gene cluster. 

Figure 2 shows the loading module (load), the ten extender modules, and the 

25 peptide synthetase domain of the FKL-520 PKS, together with, on the top line, the genes 
that encode the various domains and modules. Also shown are the various intermediates 
in FK-520 biosynthesis, as well as the structure of FK-520, with carbons 13, 15, 21, and 
31 numbered. The various domains of each module and subdomains of the loading 
module are also shown. The darkened circles showing the DH domains in modules 2, 3, 

30 and 4 indicate that the dehydratase domain is not functional as a dehydratase; this 

domain may affect the stereochemistry at the corresponding position in the polyketide. 
The substituents on the FK-520 structure that result from the action of non-PKS enzymes 
are also indicated by arrows, together with the types of enzymes or the genes that code 
for the enzymes that mediate the action. Although the methyltransferase is shown acting 

35 at the C-13 and C-15 hydroxyl groups after release of the polyketide from the PKS, the 
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methyitransferase may act on the 2-hydroxymalonyl substrate prior to or 
contemporaneously with its incorporation during polyketide synthesis. 

Figure 3 shows a close-up view of the left end of the FK-520 gene cluster, which 
contains at least ten additional genes. The ethyl side chain on carbon 21 of FK-520 
(Figure 2) is derived from an ethylmaionyl CoA extender unit that is incorporated by an 
ethylmalonyl specific AT domain in extender module 4 of the PKS. At least four of the 
genes in this region code for enzymes involved in ethylmalonyl biosynthesis. The 
polyhydroxybutyrate depolvmerase is involved in maintaining hydroxybutyryl-CoA 
pools during FK-520 production. Polyhydroxybutyrate accumulates during vegetative 
growth and disappears during stationary phase in other Sirepiomyces (Ranade and 
Vining. 1993, Can. J. Microbiol. 39:371), Open reading frames with unknown function 
are indicated with a question mark. 

Figure 4 shows a biosynthetic pathway for the biosynthesis of ethylmalonyl CoA 
from acetoacetyl CoA consistent with the function assigned to four of the genes in the 
FK-520 gene cluster shown in Figure 3. 

Figure 5 shows a close-up view of the right-end of the FK-520 PKS gene cluster 
(and of the sequences on cosmid pKOS065-C3 1). The genes shown include fkhD.fkbM 
(a methyl transferase that methylates the hydroxyl group on C-31 of FK-520), fkbN (a 
homolog of a gene described as a regulator of cholesterol oxidase and that is believed to 
be a transcriptional activator), fkbQ (a type II thioesterase, which can increase polyketide 
production levels), and fkbS (a crotonyl-CoA reductase involved in the biosynthesis of 
ethylmalonyl CoA). 

Figure 6 shows the proposed degradauve pathway for tacrolimus (FK-506) 
metabolism. 

Figure 7 shows a schematic process for the construction of recombinant PKS 
genes of the invention that encode PKS enzymes that produce 13-desmethoxy FK-506 
and FK-520 polyketides of the invention, as uescribed in Example 4, below. 

Figure 8, in Parts A and B, shows certain compounds of the invention preferred 
for dermal application in Part A and a synthetic route for making those compounds in 
PartB. 

Detailed Description of the Invention 
Given the valuable pharmaceutical properties of polyketides, there is a need for 
methods and reagents for producing large quantities of polyketides, as well as for 
producing related compounds not found in nature. The present invention provides such 
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methods and reagents, with particular application to methods and reagents for producing 
the polyketides known as FK-520, also known as ascomycin or L-683,590 (see Holt et 
a/., 1993, JACS 775:9925), and FK-506, also known as tacrolimus. Tacrolimus is a 
macrolide immunosuppressant used to prevent or treat rejection of transplanted heart, 
5 kidney, liver, lung, pancreas, and small bowel allografts. The drug is also useful for the 
prevention and treatment of graft-versus-host disease in patients receiving bone marrow 
transplants, and for the treatment of severe, refractory uveitis. There have been additional 
reports of the unapproved use of tacrolimus for other conditions, including alopecia 
universalis, autoimmune chronic active hepatitis, inflammatory bowel disease, multiple 
10 sclerosis, primary biliary cirrhosis, and scleroderma. The invention provides methods 
and reagents for making novel polyketides related in structure to FK-520 and FK-506, 
and structurally related polyketides such as rapamycin. 

The FK-506 and rapamycin polyketides are potent immunosuppressants, w r ith 
chemical structures shown below. 




OMe 



FK-506 Rapamycin 

15 

FK-520 differs from FK-506 in that it lacks the allyl group at C-21 of FK-506, having 
instead ar ethyl group at that position, and has siir.1.:..: activity to FK-506, albeit reduced 
immunosuppressive activity. 

These compounds act through initial formation of an intermediate complex with 

20 protein "iminunophilins" known as FKBPs (FK-506 binding proteins), including FKBP- 
12. Immunophilins are a class of cytosolic proteins that form complexes with molecules 
such as FK-506, FK-520, and rapamycin that in turn serve as ligands for other cellular 
targets involved in signal transduction. Binding of FK-506, FK-520, and rapamycin to 
FKBP occurs through the structurally similar segments of the polyketide molecules, 

25 known as the "FKBP-binding domain" (as generally but not precisely indicated by the 

10 
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stippled regions in the structures above). The FK-506-FKBP complex then binds 
calcineurin, while the rapamycin-FKBP complex binds to a protein known as RAFT-1. 
Binding of the FKBP-polyketide complex to these second proteins occurs through the 
dissimilar regions of the drugs known as the "effector" domains. 




^ — ^ Immunosuppression 



5 




The three component FKBP-polyketide-effector complex is required for signal 
transduction and subsequent immunosuppressive activity of FK-506, FK-520, and 
rapamycin. Modifications in the effector domains of FK-506, FK-520, and rapamycin 
that destroy binding to the effector proteins (calcineurin or RAFT) lead to loss of 

10 immunosuppressive activity, even though FKBP binding is unaffected. Further, such 
analogs antagonize the immunosuppressive effects of the parent polyketides, because 
they compete for FKBP. Such non-immunosuppressive analogs also show reduced 
toxicity (see Dumont et ai, 1992, Journal of Experimental Medicine 1 76, 75 1-760), 
indicating that much of the toxicity of these drugs is not linked to FKBP binding. 

1 5 In addition to immunosuppressive activity, FK-520, FK-506, and rapamycin have 

neurotrophic activity. In the central nervous system and in peripheral nerves, 
immunophilins are referred to as i *neuroimmunophilins , \ The neuroimrnunophilin FKBP 
is markedly enriched in the central nervous system and in peripheral nerves. Molecules 
that bind to the neuroimrnunophilin FKBP, such as FK-506 and FK-520, have the 

20 remarkable effect of stimulating nerve growth. In vitro y they act as neurotrophic, i.e., 
they promote neurite outgrowth in NGF-treated PC 12 cells and in sensory neuronal 
cultures, and in intact animals, they promote regrowth of damaged facial and sciatic 
nerves, and repair lesioned serotonin and dopamine neurons in the brain. See Gold et aL, 
Jun. 1999, J. Pharm. Exp. Ther. 289(3): 1202-1210; Lyons et a/., 1994, Proc. National 

25 Academy of Science 91: 3191-3195; Gold et a/., 1995, Journal ofNeuroscience 15: 
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7509-7516; and Steiner et aL, 1997, Proc. National Academy of Science 94: 2019-2024. 
Further, the restored central and peripheral neurons appear to be functional. 

Compared to protein neurotrophic molecules (BNDF, NGF, etc.), the small- 
molecule neurotrophins such as FK-506, FK-520, and rapamycm have different, and 
5 often advantageous, properties. First, whereas protein neurotrophins are difficult to 

deliver to their intended site of action and may require intra-cranial injection, the small- 
molecule neurotrophins display excellent bioavailability; they are active when 
administered subcutaneously and orally. Second, whereas protein neurotrophins show 
quite specific effects, the small-molecule neurotrophins show rather broad effects, 

10 Finally, whereas protein neurotrophins often show effects on normal sensory nerves, the 
small-molecule neurotrophins do not induce aberrant sprouting of normal neuronal 
processes and seem to affect damaged nerves specifically. Neuroimmunophilin iigands 
have potential therapeutic utility in a variety of disorders involving nerve degeneration 
(e.g. multipb sclerosis, Parkinson's disease, Alzheimer's disease, stroke, traumatic 

15 spinal cord and brain injury, peripheral neuropathies). 

Recent studies have shown that the immunosuppressive and neurite outgrowth 
activity of FK-506, FK-520, and rapamycin can be separated; the neuroregenerative 
activity in the absence of immunosuppressive activity is retained by agents which bind to 
FKBP but not to the effector proteins calcineunn or RAFT. See Steiner et aL, 1 997, 

20 Nature Medicine 3: 421-428. 



Available structure-activity data show that the important features for neurotrophic 
activity of rapamycin, FK-520, and FK-506 lie within the common, contiguous segments 
of the macrolide ring that bind to FKBP. This portion of the molecule is termed the 
25 "FKBP binding domain" (see VanDuyne et aL, 1993, Journal of Molecular Biology 229: 
105-124.). Nevertheless, the effector domains of the parent macrolides contribute to 
conformational rigidity of the binding domain and thus indirectly contribute to FKBP 
binding. 




Nerve Regeneration 
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"FKBP binding domain" 

There are a number of other reported analogs of FK-506, FK-520, and rapamycin that 
bind to FKBP but not the effector protein calcineunn or RAFT. T hese analogs show- 
effects on nerve regeneration without immunosuppressive effects. 
5 Naturally occurring FK-520 and FK-506 analogs include the anrascomycins, 

which are FK-506-like macrolides that lack the functional groups of FK-506 that bind to 
calciueurin (see Fehr et ai, 1996, The Journal of Antibiotics 49: 230-233). These 
molecules bind FKBP as effectively as does FK-506; they antagonize the effects of both 
FK-506 and rapamycin, yet lack immunosuppressive activity. 




^ Antascomycin A 

Other analogs can be produced by chemically modifying FK-506, FK-520, or 
rapamycin. One approach to obtaining neuroimmunophilin ligands is to destroy the 
effector binding region of FK-506, FK-520, or rapamycin by chemical modification. 
While the chemical modifications permitted on the parent compounds are quite limited, 
15 some useful chemically modified analogs exist. The FK-520 analog L-685,818 (ED50 = 
0.7 nM for FKBP binding; see Dumont et a/., 1992), and the rapamycin analog WAY- 
124,466 (IC50 = 12.5 nM; see Ocain et aL, 1993, Biochemistry Biophysical Research 
Communications 192: 1340-134693) are about as effective as FK-506, FK-520, and 
rapamycin at promoting neurite outgrowth in sensory neurons (see Steiner et ai, 1997). 
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L-685,818 WAY-124,466 



One oi tnc few positions of rapamycin that is readily amenable to chemical 
modification is the allylic 16-methoxy group; this reactive group is readily exchanged by 
acid-catalyzed nucleophilic substitution. Replacement of the 16-methoxy group of 
rapamycin with a variety of bulky groups has produced analogs showing selective loss of 
immunosuppressive activity w r hile retaining FKBP-binding (see Luengo et aL, 1995, 
Chemistry & Biology 2: 471-481). One of the best compounds, 1, below, shows complete 
loss of activity in the splenocyte proliferation assay with only a 10-fold reduction in 
binding to FKBP. 




There are also synthetic analogs of FKBP binding domains. These compounds 
reflect an approach to obtaining neuroimmunophilin ligands based on "rationally 
designed" molecules that retain the FKBP-binding region in an appropriate conformation 
for binding to FKBP 7 but do not possess the effector binding regions. In one example, the 
ends of the FKBP binding domain were tethered by hydrocarbon chains (see Holt et ai, 
1993, Journal of the American Chemical Society 115: 9925-9938); the best analog, 2, 
below, binds to FKBP about as well as FK-506. In a similar approach, the ends of the 
FKBP binding domain were tethered by a tripeptide to give analog 3, below, which binds 
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to FKBP about 20-fold poorer than FK-506, These compounds are anticipated to have 
neuroimmunophilin binding activity. 




2 3 



5 In a primate MPTP model of Parkinson's disease, administration of FKJBP ligand 

GPI-1046 caused brain cells to regenerate and behavioral measures to improve. MPTP is 
a neurotoxin, which, when administered to animals, selectively damages nigral-striatal 
dopamine neurons in the brain, mimicking the damage caused by Parkinson's disease. 
Whereas, before treatment, animals were unable to use affected limbs, the FKBP ligand 

10 restored the ability of animals to feed themselves and gave improvements in measures of 
locomotor activity, neurological outcome, and fine motor control. There were also 
corresponding increases in regrowth of damaged nerve terminals. These results 
demonstrate the utility of FKBP ligands for treatment of diseases of the CNS. 

From the above description, two general approaches towards the design of non- 

1 5 immunosuppressant, neuroimmunophilin ligands can be seen. The first involves the 

construction of constrained cyclic analogs of FK-506 in which the FKBP binding domain 
is fixed in a conformation optimal for binding to FKBP. The advantages of this approach 
are that the conformation of the analogs can be accurately modeled and predicted by 
computational methods, and the analogs closely resemble parent molecules that have 

20 proven pharmacological properties. A disadvantage is that the difficult chemistry limits 
the numbers and types of compounds that can be prepared. The second approach 
involves the trial and error construction of acy^v analogs of the FKBP binding domain 
by conventional medicinal chemistry. The advantages to this approach are that the 
chemistry is suitable for production of the numerous compounds needed for such 

25 interactive chemistry-bioassay approaches. The disadvantages are that the molecular 
types of compounds that have emerged have no known history of appropriate 
pharmacological properties, have rather labile ester functional groups, and are too 
conformationally mobile to allow accurate prediction of conformational properties. 

The present invention provides useful methods and reagents related to the first 

30 approach, but with significant advantages. The invention provides recombinant PKS 
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genes that produce a wide variety of polyketides that cannot otherwise be readily 
synthesized by chemical methodology alone. Moreover, the present invention provides 
polyketides that have either or both of the desired immunosuppressive and neurotrophic 
activities, some of which are produced only by fermentation and others of which are 
produced by fermentation and chemical modification. Thus, in one aspect, the invention 
provides compounds that optimally bind to FKBP but do not bind to the effector 
proteins. The methods and reagents of the invention can be used to prepare numerous 
constrained cyclic analogs of FK-520 in which the FKBP binding domain is fixed in a 
conformation optimal for binding to FKBP. Such compounds will show 
neuroimmunophilin binding (neurotrophic) out not immunosuppressive effects. Ine 
invention also allows direct manipulation of FK-520 and related chemical structure? via 
genetic engineering of the enzymes involved in the biosynthesis of FK-520 (as well as 
related compounds, such as FK-506 and rapamycin); similar chemical modifications are 
simply not possible because of the complexity of the structures. The invention can also 
be used to introduce "chemical handles" into normally inert positions that permit 
subsequent chemical modifications. 

Several general approaches to achieve the development of novel 
neuroimmunophilin ligands are facilitated by the methods and reagents of the present 
invention. One approach is to make tk point mutations" of the functional groups of the 
parent FK-520 structure that bind to the effector molecules to eliminate their binding 
potential. These types of structural modifications are difficult to perform by chemical 
modification, but can be readily accomplished with the methods and reagents of the 
invention. 

A second, more extensive approach facilitated by the present invention is to 
utilize molecular modeling to predict optimal structures ah initio that bind to FKBP but 
not effector molecules. Using the available X-ray crystal structure of FK-520 (or FK- 
506) bound to FKBP, molecular modeling can be used to predict polyketides that should 
optimally bind to FKBP but not calcineurin. Various macrolide structures can be 
generated by linking the ends of the FKBP-binding domain with "all possible" 
polyketidc chains of variable length and substitution patterns that can be prepared by 
genetic manipulation of the FK-520 or FK-506 PKS gene cluster in accordance with the 
methods of the invention. The ground state conformations of the virtual library can be 
determined, and compounds that possess binding domains most likely to bind well to 
FKBP can be prepared and tested. 
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Once a compound is identified in accordance with the above approaches, the 
invention can be used to generate a focused library of analogs around the lead candidate, 
to "fine tune" the compound for optimal properties. Finally, the genetic engineering 
methods of the invention can be directed towards producing "chemical handles" that 
5 enable medicinal chemists to modify positions of the molecule previously inert to 
chemical modification. This opens the path to previously prohibited chemical 
optimization oflead compounds by time-proven approaches. 

Moreover, the present invention provides polyketide compounds and the 
recombinant genes for the PKS enzymes that produce the compounds that have 

10 significant advantages over FK-506 and FK-520 and their analogs. The metabolLm and 
pharmacokinetics of tacrolimus has been expensively studied, and FK-520 is believed to 
be similar in these respects. Absorption of tacrolimus is rapid, variable, and incomplete 
from the gastrointestinal tract (Harrison's Principles of Internal Medicine, 14th edition, 
1998, McGraw Hill, 14, 20, 21, 64-67). The mean bioavailability of the oral dosage form 

1 5 is 27%. (range 5 to 65%). The volume of distribution (VolD) based on plasma is 5 to 65 
L per kg of body weight (L/kg), and is much higher than the VolD based on whole blood 
concentrations, the difference reflecting the binding of tacrolimus to red blood cells. 
Whole blood concentrations may be 12 to 67 times the plasma concentrations. Protein 
binding is high (75 to 99%), primarily to albumin and alphal-acid glycoprotein. The 

20 half-life for distribution is 0.9 hour; elimination is biphasic and variable: terminal-! 1.3 hr 
(range, 3.5 to 40.5 hours). The time to peak concentration is 0.5 to 4 hours after oral 
administration, 

Tacrolimus is metabolized primarily by cytochrome P450 3A enzymes in the 
liver and small intestine. The drug is extensively metabolized with less than 1% excreted 

25 unchanged in urine. Because hepatic dysfunction decreases clearance of tacrolimus, 
doses have to be reduced substantially in primary graft non-function, especially in 
children, in addition, drugs that induce the cytochrome P450 3 A enzymes reduce 
tacrolimus levels, while drugs that inhibit these P450s increase tacrolimus levels. 
Tacrolimus bioavailability doubles with co-administration of ketoconazole, a drug that 

30 inhibits P450 3 A. See, Vincent et al, 1992, In vitro metabolism of FK-506 in rat, rabbit, 
and human liver microsomes: Identification of a major metabolite and of cytochrome 
P450 3A as the major enzymes responsible for its metabolism, Arch. Biochem. Biophys. 
294: 454-460; Iwasaki etal., 1993, Isolation, identification, and biological activities of 
oxidative metabolites of FK-506, a potent immunosuppressive macrolide lactone, Drug 

35 Metabolism & Disposition 21: 971-977; Shiraga et a/., 1994, Metabolism of FK-506, a 
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potent immunosuppressive agent, by cytochrome P450 3A enzymes in rat, dog, and 
human liver microsomes. Diochem. Pharmacol. 47\ 727-735; and hvasaki ei al., 1995, 
Further metabolism of FK-506 (Tacrolimus); Identification and biological activities of 
the metabolites oxidized at multiple sites of FK-506, Drug Metabolism & Disposition 23: 
5 28-34. The cytochrome P450 3A subfamily of isozymes has been implicated as 
important in this degradative process. 

Structures of the eight isolated metabolites formed by liver microsomes are 
shown in Figure 6. Four metabolites of FK-506 involve demethylation of the oxygens on 
carbons 13, 15, and 31, and hydroxylation of carbon 12. The 1 3 -demethylated (hydroxy) 

10 compounds undergo cvcnzations "i me 13-hydroxy at C-10 to give MI, MVI and MVII, 
and the 12-nydroxy metabolite at C-10 to give I. .Another four metabolites formed by 
oxidation of the four metabolites mentioned above were isolated by liver microsomes 
from dexamethasone treated rats. Three of these are metabolites doubly demethylated at 
the methoxy groups on carbons 15 and 31 (MA 7 ), 13 and 31 (MAT), and 13 and 15 (M- 

15 VII). The fourth, M-VIII, was the metabolite produced after demethylation of the 31- 
methoxy group, followed by formation of a fused ring system by further oxidation. 
Among the eight metabolites, M-II has immunosuppressive activity comparable to that 
of FK-506, whereas the other metabolites exhibit weak or negligible activities. 
Importantly, the major metabolite of human, dog, and rat liver microsomes is the 13- 

20 demethylated and cyclized FK-506 (M-I). 

Thus, the major metabolism of FK-506 proceeds via 13-demethylation followed 
by cyclization to the inactive M-I, this representing about 90% of the metabolic products 
after a 10 minute incubation with liver microsomes. Analogs of tacrolimus that do not 
possess a C-13 methoxy group would not be susceptible to the first and most important 

25 biotransformation in the destructive metabolism of tacrolimus (i.e. cyclization of 13- 
hydroxy to C-10). Thus, a 13-desmethoxy analog of FK-506 should have a longer half- 
life in the body than does FK-506. The C-13 methoxy group is believed not to be 
required for binding to FKBP or calcineurin. The C-13 methoxy is not present on the 
identical position of rapamycin, which binds to FKBP with equipotent affinity as 

30 tacrolimus. Also, analysis of the 3-dimensional structure of the FKBP-tacrolimus- 
calcineurin complex shows that the C-13 methoxy has no interaction with FKBP and 
only a minor interaction with calcineurin. The present invention provides C-13- 
desmethoxy analogs of FK-506 and FK-520, as well as the recombinant genes that 
encode the PKS enzymes that catalyze their synthesis and host cells that produce the 

35 compounds. 
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These compounds exhibit, relative to their naturally occurring counterparts, 
prolonged immunosuppressive action in vivo, thereby allowing a lower dosage and/or 
reduced frequency of administration. Dosing is more predictable, because the variability 
in FK-506 dosage is largely due to variation of metabolism rate. FK-506 levels in blood 
5 can vary widely depending on interactions with drugs that induce or innibit cytochrome 
P450 3A (summarized in USP Drug Information for the Health Care Professional). Of 
particular importance are the numerous drugs that inhibit or compete for CYP 3A, 
because they increase FK-506 blood levels and lead to toxicity (Prograf package insert, 
FujisawaliUS. Rev 4/97. Rec 6/97). Also important are the drugs that induce P450 3 A 
) (e.g. Dc.wi^ .one), because they decrease FK-506 W^od 'evels and reduce efficacy. 
Because the major sue of CYP 3 A action on FK-506 is removed in the analogs provided 
by the present invention, those analogs are not as susceptible lo drug interactions as the 
naturally occurring compounds. 

Hyperglycemia, nephrotoxicity, and neurotoxicity are the most significant 

15 adverse effects resulting from the use of FK-506 and are believed to be similar for FK- 
520. Because these effects appear to occur primarily by the same mechanism as the 
immunosuppressive action (i.e. FKBP-calcineurin interaction), the intrinsic toxicity of 
the desmethoxy analogs may be similar to FK-506. Flowever, toxicity of FK-506 is dose 
related and correlates with high blood levels of the drug (Prograf package insert, 

20 FujisavvaGUS, Rev 4/97. Rec 6/97). Because the levels of the compounds provided by 
the present invention should be more controllable, the incidence of toxicity should be 
significant!}' decreased with the 13-desmethoxy analogs. Some reports show that certain 
FK-506 metabolites are more toxic than FK-506 itself, and this provides an additional 
reason to expect that a CYP 3A resistant analog can have lower toxicity and a higher 

25 therapeutic index. 

Thus, the present invention provides novel compounds related in structure to FK- 
506 and FK-520 but with improved pronerties. The invention also provides methods for 
making these compounds by fermentation of recombinant host cells, as well as the 
recombinant host cells, the recombinant vectors in those host cells, and the recombinant 

30 proteins encoded by those vectors. The present invention also provides other valuable 
materials useful in the construction of these recombinant vectors that have many other 
important applications as well. In particular, the present invention provides the FK-520 
PKS genes, as well as certain genes involved in the biosynthesis of FK-520 in 
recombinant form. 
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FK-520 is produced at relatively low levels in the naturally occurring cells, 
Streptomyces hygroscopicus var. ascomyccticus. in which it was first identified. Thus, 
another benefit provided by the recombinant FK-520 PKS and related genes of the 
present invention is the ability to produce FK-520 in greater quantities in the 
5 recombinant host cells provided by the invention. The invention also provides methods 
for making novel FK-520 analogs, in addition to the desmcthoxy analogs described 
above, and derivatives in recombinant host cells of any origin. 

The biosynthesis of FK-520 involves the action of several enzymes. The FK-520 
PKS enzyme, which is composed of the fkbA, fkbB,fkhC\ wd fkbP gene products, 

10 Dwithesizes the core ouuCture oi the molecule. There is aiso a hydroxylation at C-9 

mediated by the P450 hydroxylase that is thc fkbD gene product and that is oxidized by 
the fkhO gene product to result in the formation of a keto group at C-9. There is also a 
methylation at C-31 that is mediated by an O-methyltransferasc that is the fkbM gene 
product. There are also methylations at the C-13 and C-15 positions by a 

15 methyltransferase believed to be encoded by the fkbG gene; this methyltransferase may 
act on the hydroxymalonyl CoA substrates prior to binding of the substrate to the AT 
domains of the PKS during polyketide synthesis. The present invention provides the 
genes encoding these enzymes in recombinant form. The invention also provides the 
genes encoding the enzymes involved in ethylmalonyl CoA and 2-hydroxymalonyI CoA 

20 biosynthesis in recombinant form. Moreover, the invention provides Streptomyces 

hygroscopicus var. ascomyceticus recombinant host cells lacking one or more of these 
genes that are useful in the production of useful compounds. 

The cells are useful in production in a variety of ways. First, certain cells make a 
useful FK-520-related compound merely as a result of inactivation of one or more of the 

25 FK-520 biosynthesis genes. Thus, by inactivating the C-31 O-methyltransferase gene in 
Streptomyces hygroscopicus var. ascomyceticus, one creates a host cell that makes a 
desmethyl (at C-31) derivative of FK-520. Sec; rid, other cells of the invention are unable 
to make FK-520 or FK-520 related compounds due to an inactivation of one or more of 
the PKS genes. These cells are useful in the production of other polyketides produced by 

30 PKS enzymes that are encoded on recombinant expression vectors and introduced into 
the host cell. 

Moreover, if only one PKS gene is inactivated, the ability to produce FK-520 or 
an FK-520 derivative compound is restored by introduction of a recombinant expression 
vector that contains the functional gene in a modified or unmodified form. The 
35 introduced gene produces a gene product that, together with the other endogenous and 
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functional gene products, produces the desired compound. This methodology enables 
one- to produce FK-520 derivative compounds without requiring that all of the genes for 
the PKS enzyme be present on one or more expression vectors. Additional applications 
and benefits of such cells and methodology will be readily apparent to those of skill in 
5 the art after consideration of how the recombinant genes were isolated and employed in 
the construction of the compounds of the invention. 

The FK-520 biosynthetic genes were isolated by the following procedure. 
Genomic DNA was isolated from Streptomyces hygroscopicus van ascomyceticus 
(ATCC 14891) using the lysozyme/proteinase K protocol described in Genetic 

10 Manipulation of Streptomyces - A Laboratory Manual (Hopwood et aL, 1986). 

average size of the DNA was estimated to be between 80 - 1 20 kb by electrophoresis on 
0.3% agarose gels. A library was constructed in the SuperCos™ vector according to the 
manufacturer's instructions and with the reagents provided in the commercially available 
kit (Stratagene). Briefly, 100 ug of genomic DNA was partially digested with 4 units of 

15 Saulk I for 20 min. in a reaction volume of 1 mL, and the fragments were 

dephosphorylated and ligated to SuperCos vector amis. The ligated DNA was packaged 
and used to infect log-stage XLl-BlueMR cells. A library of about 10,000 independent 
cosmid clones was obtained. 

Based on recently published sequence from the FK-506 cluster (Motamedi and 

20 Shafiee, 1998, Eur. J. Biochem. 256: 528), a probe for the fkbO gene was isolated from 
ATCC 14891 using PCR with degenerate primers. With this probe, a cosmid designated 
pKOS034-124 was isolated from the library. With probes made from the ends of cosmid 
pKOS034-124, an additional cosmid designated pKOS034-120 was isolated. These 
cosmids (pKOS034-124 and pKOS034-120) were shown to contain DNA inserts th?t 

25 overlap with one another. Initial sequence data from these two cosmids generated 

sequences similar to sequences from the FK-506 and rapamycin clusters, indicating that 
the inserts were from the FK-520 PKS gene cluster. Two EcoRl fragments were 
subcloned from cosmids pKOS034-124 and pKOS034-120. These subclones were used 
to prepare shotgun libraries by partial digestion with Sau3Al, gel purification of 

30 fragments between 1.5 kb and 3 kb in size, and ligation into the pLitmus28 vector (New 
England Biolabs). These libraries were sequenced using dye terminators on a Beckmann 
CEQ2000 capillary electrophoresis sequencer, according to the manufacturer's protocols. 

To obtain cosmids containing sequence on the left and right sides of the 
sequenced region described above, a new cosmid library of ATCC 14891 DNA was 

35 prepared essentially as described above. This new library was screened with a new JkbM 
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probe isolated using DNA from ATCC 14891. A probe representing the JkbP gene at the 
end of cosmid pKOS034-124 was also used. Several additional cosmids to the right of 
the previously sequenced region were identified. Cosmids pKOS065-C31 and pKOS065- 
C3 were identified and then mapped with restriction enzymes. Initial sequences from 
5 these cosmids were consistent with the expected organization of the cluster in this 

region. More extensive sequencing showed that both cosmids contained in addition to the 
desired sequences, other sequences not contiguous to the desired sequences on the host 
cell chromosomal DNA. Probing of additional cosmid librar ies identified two additional 
cosmids, pKOS065-M27 and pKOS065-M21, that contained the desired sequences in a 

10 contiguous segment of cnromosomal DNA. Cosmids pKOS034-124, pKOS034-120, 
pKOS065-M27, and pKOS065-M21 have been deposited with the Amencan Type 
Culture Collection, Manassas, VA„ USA, The complete nucleotide sequence of the 
coding sequences of the genes that encode the proteins of the FK-520 PKS are shown 
below but can also be determined from the c^mids of the invention deposited with the 

15 ATCC using standard methodology. 

Referring to Figures 1 and 3, the FK-520 PKS gene cluster is composed of four 
open reading frames designated fkbB,fkhC,fkbA, and JkbP. The JkbB open reading frame 
encodes the loading module and the first four extender modules of the PKS. The fkbC 
open reading frame encodes extender modules five and six of the PKS. ThzjkbA open 

20 reading frame encodes extender modules seven, eight, nine, and ten of the PKS. The 
JkbP open reading frame encodes the NRPS of the PKS. Each of these genes can be 
isolated from the cosmids of the invention described above. The DNA sequences of these 
genes are provided below preceded by the following table identifying the start and stop 
codons of the open reading frames of each gene and the modules and domains contained 



25 


therein. 






Nucleotides 


Gene or Domain 




complement (412 - 1836) 


JkbW 




complement (2020 - 3579) 


fkbV 


30 


complement (3969 - 4496) 


fkbR2 




complement (4595 - 5488) 


fkbRl 




5601 - 6818 


fkbE 




6808 - 8052 


fkbF 




8156 - 8824 


fkbG 


35 


complement (9122 - 9883) 


fkbH 




complement (9894 - 10994) 


fkbl 




complement (10987 - 11247) 


fkbJ 




complement (11244- 12092) 


fkbK 




complement (121 13 - 13150) 


fkbL 


40 


complement (13212 - 23988) 


fkbC 
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complement (23992 


- 46573) 


fkbB 




46754 - 47788 




jkbO 




47785 - 52272 




fkbP 




52275 - 71465 




fkbA 


5 


71462 - 72628 




fkbD 




72625 - 73407 




'fkbM 




complement (73460 


- 76202) 


fkbN 




complement (76336 


- 77080) 


fkbO 




complement (77076 


- 77535) 


'jkbS 


10 


complement (44974 


- 46573) 


CoA ligase of loading domain 




complement (43777 


- 44629) 


ER of loading domain 




complement (43144 


- 43660) 


ACP of loading domain 




complement (41842 


- 43093) 


KS of extender module 1 (K.S1) 




complernent(40609 ■ 


- 41842) 


ATI 


15 


complement (39442 


- 40609) 


Dill 




complement (38677 


- 39307) 


KR1 




complement (38371 


- 38581) 


ACPI 




complement (37145 


- 38296) 


KS2 




complement (35749 


- 37144) 
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1. 777ACGGA7C A7777AGTCA G7GG7GA77G 723;_73302.3 77A773GGAA TAAAGGGCGG 

in o'tacaagatc c7CACattgg gcgaccgcca 377^.37333 i agtt'gcctca gaggcaaacc 

15 3A73AGGGTG7 GGGGGGTGGG 7377'7, ..: o .. . 7._:.7.v. 1 ..: - o.-.GTGGC GCACCCGCGG 

Gil A77G77ACC7 7T7TCCCCCG GGGG7GGGA7 J 7 J 7 ., : 1 : 7 A7GG77GG GCTCTC: ICG 

/, .. 1 .". , wov.7'o:7 , 7A CGGGCGCGGT GIGGCuTCGG — ... - jj^nTCuG'^ CG^oTGACGG 

361 7A7GGGGAGG GGG7ACGGCG GCCG7GGC7C G7G777A77G 333773GGCG GTCATCCGTC 

■i ^ . .j:.". G AGGoC.'-'vC TCGGC^A.GC A G'..ujACvjC o ? 1 ^ j ^ - ,. ... . 'jjo^lGoA C^riCCGTGTG 

— 0 I 3_ ^ _ 7 ■„ 'o 7GGG ... o ; ^oCoG TGGC 3 o .j I-jGG^A ^ „■ _ i - - j^G'jT A.oGCTGAG C G 

."_ -1 - j. ;;:\Cn^oc .-'.G ~7_A.~. Gvj CC G^H^T^^GT-^ ^ .r . - . ... ... -J A. :j G ^GiCGulGTG 

c ._ 7 77 7 2 77 77 TCGATG GGG 7 jT.^jCj-j . . _? .. _ : .: — .j3..G_C oG.^CATACov- 

r , ^ _ G 7 0; 7 A. 7 A 7 G 7 7 7 G A G 7 7 7 G G 1777 G 7 A G G 1 A 1 7 ■. - 3 A 7 . 1 ; % - .-.3 3 G 7 7 T G G A 7 G G 7 G A. T 

'7 7A.GCGG777G CCGA.TA.CGAC CGGTC^ACGC G^-.7~7^.3-_ - _r o 7CGCGT GGA.CGC7GGA. 

25 '7. G17AGCGGGTG GGG7AG7CG7 AGTCGGCA7C GCAG777GJ..: AC7G7CCCCG GGGCGCAATA 

84 1 7GGTG IGCCG GGT7777777 7CG7A7GGAA G7733GG733 AA177777G7 GGTAGACGCG 

901 77GGGTGAGA 77 77 AG 7 AG A CCTCGTGGTG GTA3GG73A3 AAGAA777GG AGTCGGCGGG 

961 GAA7C7GGCG CG GAG C AG GG CCTCGGGCG7 CTGGC7GG77 GCGGGGCCGC G7GCCGGGTA 

177 1 GGTGGGGTAG 7CG7GGAGGG CGGCGGGCAG GAAGGTGAAG AGG7TGGGAC CGTCCGCGCG 

30 138 1 77A7AGGGTG C777C77AG7 CGACTCCTCC GT7G7A7A77 77GGGATGGT TCTCCAGCTG 

114* 77AG7G7A7G AGG7AGCCG7 7GT7GGACAT 7CCGG7GA77 AGGG7GCGCT CGAGCGGCCG 

1201 G7GGTAGGGC TGGGGGAGGG ACGCGGGGG7 GG7C7GGG77 AG7TGGGTGA GGGGGGTGTT 

1261 77 AG 7 CGGGG A7GG7GTCGC C7GGCCGGGA GCCA7CA2G3 7 AG AJACGCGG GGCCGGTGTT 

1321 G7C7TTGTCG GTGGCGGCGT AGGGGTAAG7 G7GGG7GA77 A777A277GG 7GATGGCCCG 

35 138 1 G77G77GG7G 7A7TG7TGG7 G G TT AC C GGG G77G77G377 A7GAG7AGGG CA.CCGTTCCA 

14 4 1 G7GGTCGGGC AG7GGGATGA CGAAC7GGGC G77G77G777 2A77GGTGGT 7GGTGTTGGT 

1301 G3GTGGAGGTG TCGGGGAAGT A3CCG7CGAT 77GG.-.7777 GGCA.CTCCGG TGGGAG7GGC 

1301 7AGG7TGT7G GG7GT7AG77 77G77CAG77 7G77G3G77G 77 37GGCGGG TGGCCGCCGT 

1021 777CG77G7G G77A.GCTCGT 77A.GGG.7G77 37777 3 317.-". 7^7G73GCCG CGGGGAGACG 

40 :63 1 7AG77GGGA7 AGA7GGG7G7 AG7GAGCG77 7GGG33A733 GGAGCAGGCC GGGGGGTGGG 

17.11 7GGTG AG GGG AGCAGGAGGG 7GACTGGGG7 7A7GG7GA7A 37377GAGGC CGGTGCGTCT 

1301 TC7CGGGGCC 7G77CGACA7 7GAGGGGGAG AA77A773A3 AG7777CAGA CGTGCGGATG 

13 61 GA7GA7GGAG TGGAGG7TAG G77GCGCA7G G7G3AGA33A A7ATGGGTGG GCCCGCGATG 

1921 AC7GAGG7CC CTCAGAGGTG GGCCGCCG77 A7GA7GG3GG 7GGGACCGCG GGCGCTCCGG 

45 1931 GGGGG:GGCG GCGGGGGCCA CCGGTTCCGG GT77G7GGGT 7AGGGACAGG TGTCGTTCGC 

204 1 GACGGTGAAG TAGCCGG7GG GCGACTCTT? 7AAGG7GG77 G7GACGAAGG TGTTGTACAG 

2101 GCCCATGTTC TGGCCGGAG7 CG77GGCGTA CG7GTAA77G GGGG7GGTCG "TGGGGCGGCG 

2 161 GG7CTGGAGG 7GAGCGTAG7 TGCCGGCGGT CCAG7AGA07 GCGGTGGCAC CGGTCGTCTG 

2221 7GCGGTGAC7 GGGCGGGAGA GGGGTCCGGG 277GG7G737 GGGTCCCGGG CGGCGACCGC 

50 22 6' G7AGG7GTGC GA7GTGCCCG CCC7GAGGCC GGTG7 77G7G 7ACGACGTCG TGGCGGAGGT 

2311 GG7CATCTGG GGAGGGTCGG GG7GGACGG7 3TAG7GG377 GCGCCGTCGA CGGGTTTCCA 

2101 GG7GAGGG7G A7GGTGG7GT CGGTGGCGCC GG7G73GG77 AGG77GGACG GAGCGGGCAG 

2161 GG^J\GGGGGG TCGGAGGCGG ATCCGGTGAG GC2GAAG.A'.7 7G7GTGATCC AGTAGCTGGA 

252 1 A7A.GA.T7GAG TCC.AGGAAGT AGGCGGCG77 GG7G77G77.; 7-7TG77GTG CTGCGGi GGG 

55 2331 GGGATCGAGC GGGGTGCCGT GCCCG AT GCC CGGCACC7G3 7T7ACCTCCA CGGCCACCGA 

264 1 TCCGTCCGGG GCCAGGTAG7 CCTCGTGCCG GG7GGAG777 GGGGGGATCA CCGAGGTACG 

2~T2 GTCCGGCGTC TGGGACACGC CGTGCACAGC GG777A7737 TGGGGGAAGT GGTCGGCGTT 

2 7 61 GCGCGGCGCG ACGGTGGTGT CC77GTCG3C G TGGC AG AT G GGGAGGCGCG GCCACGGGGC 

2521 7GACCACGAG GGGTAGGGG7 CACGGACCCG CCG7G777A7 7GG7GCGCGG TCAGGTCGGT 

60 233 1 GGGGGGGT7C ATGGAGAGGT ACGCGCTGCT GA7GT7GG7G GCA7AGCCGA AGGGCAGGCC 

2 94 1 GGGGACGAGC GCGCCGGCCT GGAAGACGTC GGG A7AGG7G GGG AG CATC A. CCGACGTCAT 
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GGG7CGGGGC CGTAGGCGGA 
GCCTGGCCCC TGCGGTTGTC 
GAGGACGTGG TCTCGGCGAA 
JGGGAG7TG7 GGGCGTAGCC 
GCCGGCTCCG CGGGCAGGGA 

v, ( ^ ^ <^ lo G 1 

GGGCGGTGGG 
A3GGGCAG3A GACGGG^GAG 
G 3GG.AGGr\GG AGAGGGGGAA 
3GGGGTGGG7 CGATGTCGTG 
GGGGGGGGTG GGCGGCAGCG 
A '2 AC G C G G G A C C G C G C G A T G 
GCGGACGCG.-s CGAlACGCCCC 
G7GGGGCTTG CCTGCCCTGG 
3o'o7aTGGCu oGCGAGGACG 
"... CG'i'G^C-^ G ^CGGGPiGGGC 
2G7CCCAGAC GGGTTCCACC 
37GTGCGGTC GAAGGTGATG 
7GA7GA7GCG TACGTCAGGT 
AGTAGATGGT GTCGCCCTCG 
7GGGGA7GAC ATGCTGGGAG 
AGGTGGAG7C CACCAGCGGC 
2-GAGCGGCGC GGTGT7CTGC 

. j7; jC^C: : . j 'J'JoGTvjGCGG 

4 4 4 1 32AGA.CGTCGC GGGTGGTG.AA G7GC7GG.".A.G 7.^.GCGGG-/-T 2:^..G^.GGC^7l 'oACCAGAGCG 
4 331 G7GCGG G7GG GG7CGTGGTG 3GG3TTC7GA G72G7GATG3 2GG7GAT7G7 GGGAAGTCCC 
4 561 GGGGGGGGTG TGAAATGCGG AACGTTCACC GGGC7GA7A.C GTGGGGGGCA TGAGCC^TGG 
4 621 ACCGTACGTA G7CGTAGAAG CTGGCGAGCA G7GGGGCGGG TGGTCC7CGG GGGAGTGTGA 
4 681 CCACGGCGAG GGTGCGCGGG GGGTGGGGGT CG7CGAGCGG GACGGGGACG GCGTGGTCAC 
474 1 GGGGCCGGGA CGGGCTGCCG G7GAGGGGGG CGAGGGGGAG ACGGAGGCCG GCGGGGACCA 
4 801 GGGCCCGCAG CG7GCTCAGC TGGGTGCTC7 C GAG GAG GAG GGGGGGCACG AATCCGGCCG 
4 8 61 CGGCGCACAG CGGGTGGGTG ATGTGGCGCA GTCCGAAGAC CGGCTCCAGT GCCACGAACG 
4 921 CCTCATCGGC CAGCTCCGCG G7CCGCACCC GGCGGCGTCT GGCCAGCGGG TGTCCGGGTG 

4 981 GGACGAGGAG GCACAGTGCC TCGTCCCGCA G7GG7GTCCA C7CGAGATCG TCCCCGGCGG 
504 1 GTCGTGGGCT GGTCAGCCCC AGGTCCAGCC 7GG7GTTGGG GACGTCGTCG ACCACGGCGT 
5101 GGGGGGGGTG GCCGCGCAGT 7GGAAGGTGG 7GGGGGGAGC CAGCCGGGGG TACCCGGCGA 
5161 GGAGG7CGGG CACCAGCCAG GTGGCGTAGG AGTGCAGGAA ACCCAGTGGC ACGGTGCCGG 
32 21 TGTCGGGGTC GATCAGGGCG G7GA7GCGG7 GCTCGGCGCC GGAGACCTCA CTGATCGCGC 
5281 GCAGGGGGTG GGCGCGG:~AG ACC7CGCCGT ACT 7GTTGAG CCGGAGGGGG TTCTGGTGCC 
234 1 GGTCGAACAG CGGCACGCCC AGTGGTGGG7 GGAGGGGGCG GATGGCCCTG GACAGGGTCG 
54 01 GC7GGGAGAT GTTGAGCCG7 TGGGCGGTGA TCGTGACGTG C7CGTGC7GG GCCfJkGGCCG 

5 4 61 TGAACCACTG CAAC7CCCGT ATCTCCATGG AGGGACTATA CGTACCGGGC ATGGTCCTGG 
552 1 GGAGGTTTGG TCATTTCAGA GGGGCCGGGG GGGGGC7GAG AGTGAGTCCT CACCAACCAG 
5531 GACCCGATGG GAGGGACCCC ATGTGGGAGC CGCATCCTCG 5GCTGAAGAG GAACGCCCCG 
564 1 GCGGGCCCGT GTGGGGTCTG C7GGTGGTTT C7T7GGAGGA GGGCGTCGCC GGTCCGTTCG 
5701 CCACCCGCCA CC7GGCCGAC G7GGGCGGGG G7G7CATCAA GA.TCGAACGC GGCGGCAGCG 
57 61 GCGACCTCGG CCGCGGCTAC GACCGCACGG TGGGTGGCAT GTCCAGCCAC TTCGTCTGGC 
53 21 TGAACGGGGG GAAGGAGAGC GTCCAGCTCG ATGTGCGCTC GCCGGAGGGC AACCGGCACC 
58 SI TG^ACGCCTT GGTGGACCGG GCCGATG^CC TGGTGCAGAA TCTGGCACCC GGCGCCGCGG 
594 1 GCCGCCTGGC ATCGGCCACC AGGTCCTCGC GCGGAGCCAG CGAGGCTGAT CACCTGCGGA 
6001 CATATCCGGC TACGGCAGTA CCGGCTGGTA CCGCGGACCG CAAGGCGTAC GACCTCCTGG 
60 61 TCCAGTGCGA AGCGGGGGTG GTCTCCA7CA CCGGCACCCC GGAGACCCCG TGCAAGGTGG 
6121 GCCTGTCCAT CGCGGACATC 7G7GCGGGGA 7GTACGCG7A CTCCGGCATC CTCACGGCCC 
618 1 TGGTGAAGGG GGCCCGCAGG GGCCGGGGCT CGCAGTTGGA GGTCTCGATG CTCGAAGCCC 
624 1 TCGGTGAATG GATGGGATAG GCGGAGTACT AGACGCGGTA CGGCGGCACC GCTCCGGCCC 
6301 GCGCCGGCGC CAGCCACGCG ACGATCGCCC CGTAGGGCCC GTTCACCACG CGCGACGGGC 
6361 AGACGATCAA TCTCGGGCTC CAGAACGAGC GGGAGTGGGC TTCCTTCTGC GGTGTCGTGC 
64 21 TACAACGCCC CGGTCTCTGC GACGACCCGC GCTTTTCCGG C.AACGCCGAC CGGGTGGCGC 
64 SI ACCGCAGCGA GCTCGACGCC G7GGTGAGGG AGGTGACGGG CACGCTCAGC GGCGAGGAAC 
654 1 TGGTGGCGCG GCTGGAGGAG GCGTCGATCG CCTACGCACG CGAGCGCACC GTGCGGGAGT 

6 601 TCAGCGAACA CCCCGAA.CTG CGTGACCGTG GAGGGTGGGG TCCGTTCGAC AGCCCGGTCG 
6661 GTGCGCTGGA GGGCCTGATC CCCCCGGTCA CCTTCCACGG CGAGCACCCG CGGCGGCTGG 
67 21 GCCGGGTCCC GGAGCTGGGC GAG CAT AC GG AGTCCGTCCT GGCGTGGCTG GCCGCGCCCC 
67 81 ACAGCGCCGA CCGCGAAGAG GCCGGCCATG CCGAATGAAC TCACCGGAGT CCTGATCCTG 
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13951 G77CGCC3GC GGG7AG77GC CCTGACCGGG GGTGCGGAGG 
1-3041 GACGACGAA7 GGGGGGAGGT CGGTGTCGCG GGTGAGCGGG 
14101 GGCCTTGGGT TTGAGGACGG TGTCGATGCG GTCGGGGGTG AGGTTGTCGA GCAGGGCGTC 
14161 GTCGAGGGT7 CCGGGGGTGT GGAAGACGGC GGTGAGGGGT TGAGGGATGT GGGCGAGGGT 

60 14 221 GGTGGCGAGT TGG'TGGGGGT CGCCGACGTC GCAGGGGAGG 7GGGTGCGGG GGGTGGTGTC 
14231 GGGGGG7GGG G GGG GGG AG A GGAGGTAGGT G7GGGGG7GG T7CAGGTGGC GGGCGAGGAT 
1 4 34 1 GCCGGCGAGC- GTGCGGGAGC CGCCGGTGAT GACGAGGGGC CCCTCGGGGT CCAGCGGCCG 
14401 GGGGACCGGG AGGAGGATCT TGCCGGTGTG CTCGCCGCGG CTCATGGTCG CCAGCGCCTC 
144 61 GGGGACCGGG CG CAT GTCGT GCACCGTCAG CGGCAGCGGG TGCAGCACAC CGCGCGCGAA 
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^rt.GGCGCGoG 


GGA7CCACG7 


4 3 98 1 


C -o G C C AG G T C 


GAACGGCTGC 


7oGoCGoCG7 


GGCGGrt id w 


GGTCCTGCCC 


ATCTCGACGA 


4 4 0 4 1 


ACCGCCCGCC 


CGGTGCGAGC 


A. G C C G A T G G 


ACGCG7CGAG 


GAGTTCACCG 


GTGAGCGAGT 


4 4 101 


7 GAG C AC G AC 


GTCGACCGGC 


'oourAGu TG - 


CGGCGAACGC 


GG CGCTGCGG 


GAGTTCGCCA 


4 4 161 


CA7GGTCGGT 


GTCGAAGCCG 




GCAGG7GT7G 


7TTGGCGGGA 


CTGGCGGTGG 


4 4 22 1 


CG7ACACCTC 


G o C G C Co AG 


i. ^'ov-GGGCG.A 


TCCGGGTCGC 


CGCCATGCCG 


ACACCGCCCG 


4 4 2 3 1 


i l i C G G C G T G 


GACCAGGACC 


„ 7 v j. oGCCuo 


G 7 C 0 C AG C T C 


GCCCGCGTCG 


ACGAGGCCGT 


4 4 3 4 1 


ACCAGGCGGT 


GGCGAACACG 


_ 'jGLjCnL.Cw 


ACGCGGCGA7 


GGGGAACGAC 


CATCCCCGTG 


4 4 4 0 1 


uoAi CoGTGC 


GACCAGCCGC 


^'vdGTCCGCGA 


C C A.C G G T G C G 


CCGGAACGCG 


TCCTGCACGA 


4 4 4 61 


GACCGAACAC 


G CG G 7 CGCCG 


G'ju^j^C . m .G '■ j I 


CG7CGACGCC 


GGGTCCGACT 


TCGGTCACGA 


4 4 5 2 1 


7GCCCGCGGC 


C7CCCCGCCC 


ATCTCGCCCT 


CGC^CGG-j * t\ 


GGTGCCGAGC 


GCGATCAGCA 


4 4 55 1 


^.uTv-GLiju.' uA 


GTTCAGCCCC 


o C G 'o C o G ^ 


C G7CG.A7GCG 


GACCT CGCCG 


GCGGCCA.GGG 


4 4 64 1 


o o bCGuC'ouG 


ACGTCGAGCG 


O O ^ ^_ O 


G 1 J> O i G 


AGCGTTCCGG 


AGGCGGGCGG 


4 470 1 


GCGCAGCGCC 


CAC7GGCGCG 


o i CuuCAGu'j 


GGG*Goi'oiC 


CGCGCGTACC 


A.'oCCGGojoCA 


4 47 61 


CG7AGGCCAC 


GCCGGCCCGC 


AGCGCGA7C7 


GGGGTTCGiwG 


GAGCGAGGGC 


GCGGCGGGGA. 


4 482 1 


CGAGGTCGTC 


ATCGCCGTCC 


GTGTCCACCA 


GCACGAACGA 


7CCGGGTTCG 


GCGGCCTGGC 


44 88 1 


GGCGCAGCGC 


CTCGTCCCAG 


A 'o >t> C G G G C C 7 


GG7CCGCG7C 


CGGG AiTCTCG 


GCCGGGCCGA 


4 4 9 4 1 


^Gl*v OACCG'w 


ooGGCtoGG i 


■'"Vv'GAk^Cvjj i i^C 


GGCG^GGTvjA 


CGGGGTGCCG 


GGCAGGTCGC 


4 5001 
45061 


GCCGCTCCCA 


GACCAGTTCG 

CGCGAGCCGC 


CAiCAGCGTGG 


CCTCGCCACT 
CC77GCCCGA 


GCCGGTGGCG 
CGCGGTGCGG 


ACCAGATGGG 
GGGATCGTGG 


o o o o o hG C u C 


u^tjv.oCTbbA 


4 5121 


TGACGTGCCA 


GATCTCGTCG 


'o G C A C C T 7 o A. 


AG TAG G C G A.G 


CCGGCGGCGG 


CACTCGGCGA 


45181 


GGATCGCCTC 


GGCGoGG ACo- 


^wGGGGGCCvjT 


CGGAAACGAC 


GTAGAGCACG 


GGTATGTCGC 
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4 5 2 4 9 


CGAGGACGGG 


GTGCGGGCGo 


CCCGCCGC 


GG 


CGGCGTCCCG 


GACACCGGCC 


ACCTCCTGGG 


4 - 3 0 1 


' - j."iCuG - 7 ^ 


GATGTGCGGG 


GG GTG9;A9 






o ^ o Gr^T G A T 


AGCTCCTTGA 




9 G G G G G C G G 9 


GAGGGTCAGG 


9 1 1 w o o - 




G o G C G 9' G A G G 


. ^'wG.-iGGTCG 


CCGGTGCGGT 


-3 : '> -l 9 1 


AG GAG CCGTC 


--rvG 'o AGG ~9. ^ 


9 ' j: ^ : _ G - ~ 


CG 


- j; ^ 


^ GG i9' j .G 


GGGAGCATGA 


■1 5 4 9 1 


9GGTGGGGGG 


G-> G 9 G G G C G ' 






- — , v o ■„ ^; '.; 




G^ 'oG GGG AG A 


■! - ^ - 


. j .j w . ™ . - 


-j 9 ^ G .* \ G 


jr."" '^j 1 ^ '~ 




^: ^, o J 'j; 


^. w'jCACGAvj 


CCGGGAACCC 


4 9 6 1 * 


. . - 


... i j „: _ o - - J 


■ '.j \z - 9: A G 


■ ^ -j 


A G C G G G 9 G 9 9 


. . ^ G T G G » u 


CCGTACGTGT 


4 5 6 61 


^ 1 o ^ A j G 'o 


- o GG^ . , -_ , - t - G 


G7CGCC7G 




9^GTGGGG999G 


^ -.GGGAC GC G 


GGCGAGGTGG 


. r -i - 


.* . : v. -_ w i j G . ' „ 


CAGGGCCACG 




o ^ 


G A G C G C G G G G 


GG GGGGGGAG 


AGGGGGCCGA 


•1 5 " 2 1 


l . r. .~ :-. :j) G 9 -'A ^ ; G G 


G T A G A T G G G G 


GGCACGCG 




'jAo : -:'.C G j 


.; . .'oG A G T G G 


TCGGCCAGGG 


4 5 8 A 1 


- ^ i _ ^ 




"/-G.^-Aj ... 


... 




J oAC : .:C-j 


GGGACCGTGA 


4 5 9 9 1 


'joA'v.o j G lj «n o 


G A G G G A G A G G 


GAGGl'GGCG 


_j :\ 




. j G o vj »o G G 'o 


GGCCAGAGCA 


-j y u - 


G . TCGTC ^TC 


CTCGGTCAGC 


G GG'w A'oo A 




GGAGGTGGGA 


GTGCATCGCG 


GAGCACAGGC 


■1 692 1 


CGGTGCGC7G 


TGGGGAAA GG 


AGGGGGTT 




o.-, j j: : 


j ^ 'o ■„ oGA'j 


GTGTAGAGCA 


c9<- : 






^ ^ ^ J r\ O j T 


j . 


- 1 J _ ^ j ... 




TGGGTCCCGG 


4 6 1 4 ; 


GGAGG7CCTC 


G TAG G A G A. G G 


^ T 1 ^ " 

. . _ o ^ 




■ _ ^. ^ U ^. J _ _ ^. 


GAGGAGGAGG 


AGGGTGGCGT 


■16:9: 


9G 2TGCCGG7 












r\'„u\jT ( ^oGG^ 



A6321 GGAGGGGGGG GGGG 
— 0 -:0.ic--. 'jC^.'jCAICG 1 ^ GAC ; ^GGGT'^.\^ G'^-ov-G'-jG \^».j>.~i '^G ^ ^ o vjrv_. ^ ... jnoGTCG GGGGCGAGCC 

4 6<H1 GGCGGGGCCG GAGGCGGAGT TGCGGGTACG GGAGGGGGGG TGGGGAvATCC GTGTAGGCGA 
4 650 9 G'GCGGTCGCC GCGTCGCTCG GCATGGATGC GGAGGAGCGG GGGCAACGGC GGGATTGGTT 
4 6569 CCACACGCGC CATGGAAAGA CC7V^C7Z r VC GACGAACGGG ACAAGAGCAC GGAACCGGCG 
4 6621 AGGAGTAGAG GCGGGGGAGG GTAGGAGGGT G7TGGGGAGG GGGAGCGGGT G^AGATCCCC 
25 4 6691 GTAGGGTGGG GGGGGTCGGG GGAGGGTGAT CTAGGGGGTT GGAGGCA7AC GGGGGTGCGT 
4 67.;. .AA77GGGTTG G7GATGAGGG ATGGGGGACG GGAGGGAA GG GTGGAGGCGT TGTCCATATC 
4 6SC9 7GTGAGGGCG GCGTA7TGCC GCTTCGAGAA GACCGGATGA GGGGAGGTGG AGGGTGACGA 
4 6661 GA GGG GGG TO GGCCTGATCG AGGAGGGGAG CGGCGACACC GACG7G7CGC 7GGTGGACGG 
4 6921 TGG7GGGCGG ACCGCCGTGC ACACCACGAG CGGTGACGAG GAGGGGTTGA CCGAGGTCTG 
30 4 695 1 GCACGCACAG CGCCCTGTCG AGTCCGGCA7 GGACAACGGC A7CGGG7GGG CGCGCACCGA 
4 704 1 GGGG7ACCTG T7CGGTGTCG TGCGCACCGG CGAGAGCGGG AGGTACGCCG ATGCCACCGC 
4 7101 GGGCG7C7AC ACGAACGTCT TCCAGCTCAC CCGC^CGC'VG GGGTATCCCC 7GC7CGCCCG 
4 7 161 GACCTGGAAC 7ACGTCAGCG GTATCAACAC GACGAACGGG GACGGGC'IGG AGGTGTACCG 
4 7221 GGACTTCTGC G7GGGCCGCG CCCAGGCGCT CGACGAGGGC GGGATCGACC CGGCCACCAT 
35 J723 1 GGGGGCGGCC ACCGGTATCG GCGCCCACGG GGGCGGCATC AGCTGCGTGT TCCTCGCCGC 
4 7 341 CGGGGGCGGA GTGCGGATCA AGATGGAGAA CGCGGGGG7G C7CACGGCCC ACCAC7ACCC 

AGCTGGCTGG GGCCGCCGGA 
G G ACACCG AA CGGTGCACCA 
ATGGCCCGGG 7GA7GGGCGC 
G7GGGCGAGG TGGACCACCT 
GGGCGGGTG7 GCGCCGCACG 
A7AGGCCGGG AGGA7CTGCT 
AGuAGGCCCGC GACGC7GCGC 
AGCGCGGCAG CCCGGTCCTT 
7GGACCCGGA GAGCAACGCC 
AGGGGCCGGG CCTGGAGCGT 
CGGTGTTCGA CACCGCCGAC 
7CGTGCGCGA CGCGCGGGCG 
7CGCCGCGCC G7TCGACCTC 
G7GACGACGA CCACGTTCTC 
7CGGGCTCG7 CCAACATGAA 
CTGCCGAACT GCCGCCG7TG 
AACTCACCGG CGCCGGACTG 
GCGCGGCGCG GC7CGCCCTC 
CGGGCATGGC CGAGTGGCGG 
GCGACTCCGG 7GCG7CCGTG 
GGCAGGCGGG CACGCGGGAC 
CG7ACGAGGG CCTGATCGGC 
GGGATCCG7C GTTCCGGGAA 
CGGACGCCGA CGTGCCGTTC 
CGGTCAACCC GGTCGTCCAG 
CGCTGGGCGG CATCGCGGCC 
TCGAATTCCA TGTG7ACGAG 
G7GCGCTG7T CGACGAGCCA 



40 



45 



50 



^5 



60 



•i '1L . 

4 7461 
4 7 521 
4 7 58 1 
47 64 j 
4 7701 
4 7 7 61 
4 782 1 
4 7 33 1 
4 7 94 1 
4800 1 
4 8061 
48121 
481S1 
4 8 2 4 1 
4 8 30 1 
48361 
4 3 4 2 1 
4 8 4 8 1 
4 854 1 
48601 
48661 
48721 
4 878 1 
4 884 1 
4 8 901 
48961 
49021 



JO'jGvJuGCGG 

CGGTGA7G7G 

0 'o A G A A C G 7 G 
CAAGGTC7AC 
GC7G7GGAGC 
GGTCGAAA7C 

1 7 C G b C G G AT 
TCGTCCTTCG 
TATA.ATCTCC 
GGGCTGGCGC 
GGCGAGCCCG 
GGCAGCGAGG 
GGCACCGGGC 
GCGGTGACCG 
CTCGCAGCCC 
CCGGTGCAGT 
G A C A G G C G T C 
CCCACCGACC 
CCGCCGGCCG 
T7GATGACCC 
GTGCTGGTCG 
ATGTTCGTCA 
C7CCTCGACC 
GAGAACGTCA 
GTGC7GTTGC 
GAACCGTTCC 
GAGCCGGG7G 



bo _ ^Lbv.ub^ 

C7GT7GATC7 
AGCGGCGAG7 
CGGCGCCACG 
GTCCGCCGCC 
ACCGCGGCCG 
GAAGGCATGG 
CCGCGAAGAG 
CACAGCGGCG 
CGCTCGTGCA 
TCGTCGTCGC 
7CCAGCGGGT 
AGGACGCCGC 
CGTTGATCAG 
7GCACCATGT 
AC7ACACGGC 
ACGCCGACTT 
TGGCCT ACTG 
G7CCCGGCCC 
CGCTGGCCAC 
TGCTGGCGGC 
GCACGCCCGT 
ACACGCTCGC 
GCTGCCGGGC 
TCG.AACTCGT 
AGGTGCTGCG 
GCACCGGACG 
GCGCGGTGAG 



GGGGGAGGGG 
GCGAGG7CGC 
GCGTGGAGGG 
GCGAGGA7CT 
7CGCCC7T77 
7GGCGTGAGA 
A-AAGAAGAGC 
GATCToG i T i 
ACGCCTGCGC 
GCGCCACGAG 
CC7TCCGGCC 
CCGGGTCG T C 
GGCCCTGCTG 
CGGCGGGGhC 
GCTGGGCGAC 
CGCCGCC7GG 
GCGCGAGCAA 
GCCGGTCGCC 
CGCGGTCGTC 
CTTCCAAGCG 
GGCGAACGGT 
GCTGCGCGGC 
CACGACCACG 
CGCACCGGAA 
GCGCGACGCG 
CTGGTTCACC 
CGGCGAACTG 



>^."j r\ ^ o o w ^ ^ 

CGGCATCC7C 
CCTCGACAAC 

G GAT AC 

GCACACCGAC 

ATACCCGGTA 

GTCACCGGAC 

CTCCAGCAA7 

GGTCTA77GG 

GCGTTGCGGA 

CCGGAACACC 

CGCGAGGAGA 

ATCCGCC7CG 

GGCTGGTCG7 

AC7GGCCGGG 

GAGCG GCGGG 

CTCC ; oG'jo _ j 

GAGGGGGACG 

ACGCTCGCGC 

GTCC7CGCGG 

ACGCGGGG 

GACCTCTCGG 

GACGCGTTCG 

CGCGACCTGT 

GCGACGGGCG 

CGCTTCGACC 

CTCTACAGCC 
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4 9031 




GGTTGCTu'or'. 




G -j 0 4' '0 „ T 


TC 


49: 4 1 


GACGT^CGGC 


' " T r- ^ f ^ 


GCGGGCCGGC 


G .~ G Cj G 0 r-. G 




4 920 : 


7CGAACGACA 


^ - G VJT C G C G '.7 Gi". 


CC7GCCCG7C 


GACA0GC7 


GG 


4 92c^ 


— ^ r^.~- r--^ 


rC^'^CoGCG" 


GGTGGCCC^G 


AG 0GAGGC 






~ 4 p p ^; ^ p r- 










J ^ ^ - i 


GG0GACC7GG 


0 0GGGA7C7G 


G G0GGA7CGC 




'GG 


- • 4 T " 












4 9 5 0 ^ 


p.- - ----- - - 


~, ^ ^ ~ r. 




r*. — vj 'O 1 ^: ^ . 


;;~ 


4 9 5 6 1 


^^^^^ 


AT G 7 G 0 C G C A 


; . j r g t 0 — ■ — 0 


T 7GGG AG 0-2-. 




■1 9 c 2 1 




'o G o G A C ( o r 


G 0 AG 0 T G 0 2-. G 


GGGG2.G4-C: 




4 9 6 81 


'"*■> " ^ .' ^ *-p 1 " ^ 
^ O O o 1 - \J i" 1 - 


GCGGCAGGGG 


0 AA G ; c: C C G T 


C7CA7GCC 


0 0 


4 9 7 4 1 


C7GCTCTGGC 


A o G-AG C G G AG 


^ , ^ ^ ^ ^, „ ^, „ r . 


G u 




4 98 0 1 


^uOC^ AC G T 


T CG AC TAG TG 




* TCTTTTG 


G7 


4 9861 


>0 1 .—4 - ^ W W 'O ^ 


C G G AC G AG G T 


~, ^-4 qn -r. — r. ^-t 

v^^ljVj 4 1 v^ur.'v. 


G G G G C G G G 


AG 


■; 9 9 2 1 




CCCGGA7CTA 


CGCGC^-ACG 


GCCG7AC0 


GG 


4 9 95 1 










G T 




« m ^^rpp t r, 


G G C G G T T G G G 


0GAG01 GTGC 






-~ 0 .J. w . 




C G G C C G AA A G 




AGCGG GIGA 




5 0161 




l wV_ i n^V„b-w4 LJ 4^ w 


GA1 CGGCCCG 







r\G 'o c o G * G AG o o CGACCCG 
GGGGAGCGCG CGTGGTGCCC 
GGGGCCTGGT ( AC 
AGATCTGGGT GAGGTAGGCG 
.3 G'^CGGG'- -ji-o GAGGGCCAGG 



GC 

; AGGGC ACC 

:accgttcc 



^. o ^ .- j 4 i o ^ T -4. ■' ^-3 ^^oGCAoC^u 
G 1 G G C C T A G G C G 2 ; 4 T G T A C A C G 
GTGTGAGGGG CGTCAACCTG 
GGCGGACGGT CCAGTTCGTG 
CGCTGCTGGG CGGCACGCTC 
T G GCCCG G I o G2-4.T GG.hCG AA 
o'^vr^o^ i -oA - VwGA.G^4".Cv-3TC 
GCCAGGGCGG GG.AGGCGCTG 
ZCCkCZTGCd CGTGCACAAT 
ZGCTGZCZGZ C^ r -"CCCGAC 
ACACCCGCA7 GGATCTGC7C 
2022 1 GAGGAGGGGA TGGGGGCGGT 7GGGGAGGG7 A7GGGGGG 2 2 AGG7GTGGG7 CGCCGGCGTC 
50281 GGGCTGGGCC GTGGG7ACCT GGGCCG7CGC GAGC7GA0CG ZZGkGCGCZG GG7GCCGGGA 
50 34 1 GA7GGGG7GG GCGAGGAGCG CATGTACCTC AGGGGGGAGG 7GGGGGGGGG CGCGCGCGAG 
50401 GGCGACCTGG AATTCCTCGG CGGGA7CGAC GAGGAGGGGA AGATCGGGGG GA7CCGCGTG 
r-J4 61 GP-i4 £ \CCGGGTG AGA7CGAGAG CC7GC7CGCC GAGGAGA; J ! .^ v -^CGTGACGg.-. G'jCuGCGGTG 
50521 TCCGTGCGCG AGGAGCGGCG GGGCGAGAAG 77GC7GGCCG GGTACGTCGT AGCGGTGGCC 
5 0 d ? i GGGGGvjGAGG GCGAGGAGT7 GGGCGGGTGG G^oG^Gw^oo kjP'.G i o 'j ; wC'o w ^OuGu * bCOC 
5064 1 GCCGCGCTCG TGCCCZ'CCGC GG7CG7CCTG GTGGAGCGAG 7GCCGAGGAG CACGAGCGGC 
507 01 AA.GGTGGACC GGGGCGGGGT GCCCGACCCG GAGCCGGG GG CGGCGTCGAC GGGGGCGGTT 
5 0761 ACGCCCCGCA CCGATGCCGA GGGG.AGGG7G TGCCGGA7G7 TGGAGGAGG7 GGTCGACGTC 
506 21 CCGGGGGTCG GTGCCGACGA CGACTTCTTC ACGCTCGGCG GGCACTCCCT GGTCGCCACC 
5 08 SI CGGGTCGTCT GCCGC2-vTCCG CGCCGAGCTG GGTGCCGA7G 7CGCGGTGGG TACGCTCTTC 
5094 1 GACGGGCGGA CGCCCGCCGC GCTCGCCCGT GCGGCGGAGG AGGCCGGCCC GGCCGCCCTG 
5 10C I CCCCCGA7CG CGCCCTCCGC GGAGAACGGG CCGGCCCCCC TCACCGCGGC ACAGGAACAG 
510 61 ATGCTGCAGT CGC4ACGGC7C GC7GCTCGCC GGGCCG7GC7 AGACGGTCGC CCCGTACGGG 
51121 TTCCGGGTGC GCGGGCCACT CGACCGCGAA GCGC7GGACG GGGCACTGAC CCGGATCGCC 
5115 1 GGGCGCCACG AGCCGCTGCG GAGCGGG77C CGCGA7CGGG AJ\CAGGTCGT GCGGCCGCCC 
51241 GG7GCGG7GG GCGCCGAGGT GG77CGGGTG CCGG7CGGZG AGGTCGACGG GGCGGTCCGG 
51301 GTCGCCCACC GGGAGCTGAC CCGGCCGTTC GACCTGGTGA ACGGG7CG7T GCTGCGTGCC 
51361 GTGCTGCTGC CGCTGGGCGC CGAGGATCAC GTGCTGC7GC 7GATGGTGCA CCACCTCGCC 
51421 GG7GAGGGA7 GGTCC7TCGA CCTCCTGGTC CGGGAG77G7 GGGGGACGGA AGCGGACCTT 
514 81 CCGGTGTCCT ACACGGACGT GGGCGGGTGG GAAGGGAG7G CGGCGG7GA7 GGCGGCCAGG 
5154 1 GAGAACGACC GGGCCTACTG GCGCCGGCGG CTGGGGGGCG CGACGGCGGC GGAGCTGCCC 
51601 GCGGTCCGGC CCGGCGGGGC ACCGACCGGG CGGGCGT7GC TGTGGACGGT CAAGGACACC 
51661 GGCGTGCTGG CGGCkCGZCG GGTCGCGGAC GCCCACGACG CGACGTTGCA CG.AAACCGTG 
51721 CTCGGCGCCT TCGCCCTGG7 GGTGGCGGAG ACCGCCGACA GGGAGGAGG7 GCTCGTCGCG 
5178: ACGCCGTTCG CGGACCGGGG GTACGCCGGG AGCGAGCACG TCATCGGCTT C7TCGCGAAG 
5134 1 G7CCTCGCGC TGCGCCTCGA CCTCGGCGGC ACGCGG7CG7 7CCCCGAGG7 GCTGCGCCGG 
51901 GTGCACACCG CGATGGTGGG CGCGCACGCC CACCAGGCGG 7GCCCTAC7C CGCGCTGCGC 
51961 GCCGAGGACC CCGCGG7GCC GCCGGCCCCC GTGTCGTTCC AGCTCATCAG CGCGCTCAGC 
5202 1 GCGGAACTGC GGCTGCCCGG CATGGACACC GAGCCG7TCC CCGTCGTGGC CGAGACCGTC 
52081 GACGAGATGA CCGGCGAACT G7CGA7CAAC CTCTTCGACG ACGGTCGCAC CGTCTCCGGC 
52141 GCGGTGGTCC ACGATGCGGC GCTGCTCGAG CGTGCCACCG TCGACGATTT GCTCACCCGG 
52201 GTGGAGGCGA CGCTGCGTGC CGCCGCGGGC GAGCTCACGG TACGCGTCAC CGGTTACGTG 
522 61 G.AAAGCoAGT AGCCATGCCC GAGGAGGACA AGACAGTCGA GTACCTTCGC TGGGCGACCG 
52 321 CGGAACTCCA GAAGACCCGT GCGGAACTCG CCGCGCACAG CGAGCCGTTG GCGATCGTGG 
52 361 GGATGGCCTG CCGGCTGCCC GGGCGGGTGG CGTCGCCGGA GGACCTGTGG CAGTTGCTGG 
524 4 1 AG7CCGGTGG CGACGGCATC ACGGCG7TGC CCACGGACCG GGGCTGGGAG AGGAGCGGCG 
52501 AGGGTCGCGG CGGCTTCCTC ACCGGGGCGG CCGGCTTCGA CGCGGCGTTC TTCGGCATCA 
52 561 GCCCGGGGGA GGCGGTGGCG ATGGACCCGC AGCAGCGCCT GGCCCTGGAG ACCTCGTGGG 
52 621 AGGCG7TCGA GCACGCGGGC ATCGATCCGC AGACGCTGCG GGGCAGTGAC ACGGGGGTGT 
52681 TGCTCGGGGC GTTCTTCCAG GGGTACGGCA TCGGCGCCGA CTTCGACGG7 TACGGCACCA 

527 4 1 CGAGCATTCA CACGAGCGTG CTCTCCGGCC GCCTCGCGTA CTTCTACGGT CTGGAGGGTC 

528 01 CGGCGGTC-hC GGTCGACACG GGGTGTTCGT CGTCGCTGGT GGCGCTGCAG CAGGCCGGGC 
528 61 AGTCGCTGCG CTCCGGCG.AA 7GCTCGCTCG CCCTGGTCGG CGGCGTCACG GTGATGGCCT 
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CCCCGAC GCGCGC^GCA 

3^3C1 AGGCCCTCGC GGAAGCGGGC Coo o.-vCC GCTTCGCCGA GGGGTCGGGG GTCCTGATCG 

5 3C4C TAG AG G\A. G G T 7 1 C C G A G G ' G o : - . - o ^ 1 G G G ^ A/C C G C G C GCTGGCGGTC GTGGGGGGTT 

5.3101 GCGCCGTCAA CCGAGGACCGC GG CT C G ~ovC A; GGCTGTCCGC GCCGAACGGG GGGTCGCAGG 

~ 3 i r , ^ '.j :G _ I ^ ^cu^m'oo.- - i - - o - oo ~- - '-A- G A v. vj;GCGor\C GTGGACGCCo 

3 J w ^ _ ^; v; ... ... ^ .;oo .**.-- o O . ,~ o o 1 - o -J o.~i. w ^- o- ~ .~. - ... ^irwjKj^f*^. r. o U a vjo. I Vjo 

3 J - ^ - r : ; J A-AAA. .-ACG J o A-vG AG GA" C; C G.-.CA.CCv „ . .■ 'I o i oC . o o ..: a _GCTGAr\G TCCAACATCG 

3 3 'j -\ i ^ 3 ... G „— . G : J C -. G '.j G v C w G o j - v. o -o . jl ^ : ". - « J.-*,. uoTCC . C G o C A T G C o G C 

3 - -i v _ .j'„GA ~CCC GGCGC.--. _ G .. ^ oo t -\._o - o -. r*.'^;:-. 1 ^ ...... ... „ . CGCACG ~ G GA.GTGGAGGG 

- j ■) o . . . ; j'j.a^ i .GT o oAAv^. TCoT ^ jv„.„ „v.o To j ■„ ^ o-~~"GA^ o GA- ^ CoCCCACGGC 

5 2b2 1 G GGGGGG TGT CTCCTCCTTC GGCGTCAGCG GCACCAACGC CCACATCATC CTCGAAAGCC 

r. j o 1 A.CCCCCGACC G .jCCGCCG.^A. GZZGZZGG^^ 0A.CC3 - ^ oooACCGCTG GGGGTGCTGG 

o.:o'l I TCCCGGCCCG GA.GGGGGGAG CAAACC Co-Co ..'G.A;;;.r..r. oCGCCTGCGC GCGTTCGTGG 

GGGAGGCAGT 

3- j' ■ 6. . ^ oAG : -...-''aG'o CGCCoTGCIo G. G'^'j^o.". - ~. ^^.i.. i ■^.-^ ; ^ . o.-iG^CGG /i/'.^oCCGGCC 

^ ^ oCoCACC^'ji 'oGTGTTGGTG A.-. 2T^>>-; -ro ^r-. ..: . . ^>_vaA.AC A.GCGoGCGGC 

' ~ i ~. C T CA;: AG AC GAGCTA.G GG G CT^T T v ^ ^; : ^ .j .. _ jn^bC^CTC GACCACCTCG 

_ -. ■„' . ^ G G G G G ^ .o „ .'\ G C A.C ^_ C C _ G G ^ ^ - - - — ^ j ^- _ ... . ... ... 1 GuoTG.^ j i A.CC o C o G 

; i I 1 o . A;^.".C'j^GG- .^uuT^ iA'C .G TC^vG'jr.j^o .—.^0-^.0.0-.?.....:.- j. ; w^TCnG ^ PiG^GoGA^v^C 

5 4 121 GCCCGA/CGGA GCAACTGCGG 7GGGGCG3GG GGG.GGGGWZ'S GGTCCTGAGG AGCGAGGAAA 

GGTGAACGGG CCCCACTCCC 

z> ^ 4 1 CGTGGGGGC CGGGGACG.-.G *j.'--A j .oGGoGA^ TLu^n'oo^ j _ .AoGCr.cCT ^ 'o'oCA.TCCACC 
'Sj -J j 0' _ A. Co, G G G C Go G GA.GCGGCGr-.G oC^uu^^.-.C I ... o o^-.o ^ o . j^noGor-.C . „ GTCoCCCCCG 
-•1361 GCCCCGACGC GGCCGGGAGG GTGAG""" ACC ACGAGGGCCA GACGGCCATC GCCGGCGACC 
3^-121 C CA.CCA.CCGA.' CGAAGAGGGG GGGCA.GC.AGG GGGGiGG-- ~ A--. .^GTACGTTTC GAGGCGCAGA 
5 4 -J G I CGGAGCAGGA GGGGGGGGC3 AGGGTGGTGG AGATCGGCCC GAACCAGGAG CTCTGGCCGC 
54 5-: 1 A CGTCGACGG GGTTGGCGGC CAGACGGGGA GGCCGGAGGA GGTGGGGGCG CTGGACACCG 
54 601 CGGTCGCGCA GGTCCACGTG CGGGGCGGCG CGATCGAGGG GACGCTCGTC GTCGGCGGGG 
54 661 AGCGCGGGCG GGTCACGGGG CCCACGTATG GGTTCCAGGA GAAGGACTAG TGGCTGGGGC 
54 721 CCACCGCCCG GGCCGATGGG ^.GCGGGGGGG GGCAGGAGCA GGGGGCGGAC CCGCTGCTCG 
54 761 GGGGCGCGGG CGCGCTGCCG GGCACGGGGG GAGTCG7CCC GACGGGGGGC CTGTCGCTGG 
54 84 1 CCTCCCATCC GTGGCTCGGC GAGCACGCGG TCGACGGCAG CGTGCTCCTG CCCGGCGCGG 

54 901 GCGTCCGCGA ACTCGCGGGG CGCGCCGGCG ACGAGGTGGG CTGCGACCTG CTGCACGAAC 
5 4 96] ZGG^GZ-.VGGk GACGCCGCTC GGGGTGGGGG GGACCGGCGG GGGGGGGGrG TCCGTCGAGA 
5 5021 GCGGGGAACG GGACGAGAGG GGGGCGCGGG CGGCCAGGGG CCACGCGCGG GCCGACGGCT 
5 50S1 CGGGCCTGTG G ACC GG AG AG GCCGGCGGAC GGGGGGGGhG GGCACCGGCA CCGGCCACGG 
3 3-41 GG.ACGGACCC GGCAGCCGGC CGGGGGGCGG A U AGCCG7GP*AA" GGTGGA.CGGTG GCCGACGGCT 
5 5 201 ACGA.CCGGTC CGAGGAxCAGC GGGT ACC CGG ACGGACCo^o GTTCGGGGGG CTvjCGG^CCG 
5 5261 CCCGGCGCGG GGGCGACACG GGGTACGGGG AGGTCGGCCG CCCCGACGAG CA.G^GGGCCG 
55321 AGGCCGCCCG CTTCACGCTG CACCCCGCGG TGCTCGAGGC CGCGTCCCAG GGGGGGGGGG 
5 5 331 TGGCCGCGCT CGACGCACCC GGGGGGGGGG CCCGAGTGCG GTCCTCGTCC CAGGACGTCC 
5544 1 GCATCCACGC GGGGGGGGGG ACGCGGCTGC GGGCCACGGG GGGCCGCGAC GGCGAGCGCA 

55 501 GGACCGGCGG CAT GACCGGC CCGGACGGGC AGCTCGGGGC CGTGGTCGGT GCCGTGCTGT 
55561 CGCGCCCGTA CGCGGAAGGC TCCGGTGACG GCCTGCTGCG CGCGGTCTGG ACCGAGCTGC 
5 5 621 CGATGCCCGT GCCGTCCGCG GACGATCCGC o ^ GT G GAG GG CCTCGGCGCC GACCCGGGCG 
5 5 681 ACG GCGACGT TCCGGCGGCC ACGCGGGAGC TGACCGCGGG CGTCCTCGGC GCGCTCCAGC 
5574 1 GCCACCTGTC GGGGGGGGGG GACACCACCC TGGTGGTACG GACCGGCACC GGCCCGGCCG 
5 5301 CTGCCGCCGC CGCGGGTCTG GTCCGCTCGG CGCAGGGGGA GAACCCCGGC CGCGTCGTGC 
5 5861 TCGTCGAGGC GTCCCCGGAC ACCTCGGTGG AGCTGCCCGC GGCGTGCGCC GCGCTGGACG 
5 5 921 AACCGCAGCT GGCCGTCCGG GACGGCGTGC TCTTCGCGCG GCGGCTGGTC CGGATGTCCG 
5 5 961 ACCCCGCGCA CGGCCCGCGG TCCCTGCCGG ACGGCGACTG GCTGCTCACC CGGTCCGCCT 
5604 1 CCGGCACGTT GCACGACGTC GCGCTCATAG CCGACGACAG CCCCCGGCGG GCGCTCGAAG 
5 6101 CCGGCGAGGT CCG CATC GAG GTCCGCGCGG CCGGAGGGAA CTTCCGCGAT GTGCTGATCG 
5 6161 CGGTCGGGAC GTACACCGGG GCCACGGCCA TGGGCGGCGA GGCCGCGGGC GTCGTGGTGG 
5 6 221 AGAGCGGGCC CGGCGTGGAC GACCTGTCCC CCGGCGACCG GGTGTTCGGC GTGhGCCGGG 
5 5281 GCGGCATCGG CCGGACGGGC GTCACCGACC GGCGCTGGCT GGCCCGGATC CCCGACGGCT 
5 6 341 GGAGCTTCAC CACGGCGGCG TCCGTCCCGA TCGTGTCCGC GACCGCGTGG TACGGCGTGG 
5 6401 TCGACCTCGG CACACTGCGG GCCGGCGAGA AGGTCCTCGT CCACGCGGCC ACCGGCGGTG 
5 6461 TCGGCATGGC CGCCGCACAG ATCGCCCGCC ACCTGGGCGC CGAGCTCTAC GCCACGGCCA 
5 6 521 GTACCGGCA7'. GCAGCACGTC CTGCGCGGCG CCGGGCTGCC CGACACGCAC ATCGCCGACT 
5 6581 CT CGG ACG AC CGCGTTCCGG ACCGCTTTCC CGCGCATGGA CGTCGTCCTG AACGCGCTGA 
5 6 641 CCGGCGAGTT CATCGACGCG TCGCTCGACC TGCTGGACGG CGACGGCCGG TTCGTCGAGA 
5 6701 TGGGCGGGhG CGAGCTGCGC GACCCGGGCG CGATCGTCCC CGCCTACCTG CCGTTCGACC 



38 



WO 00/20601 



PCT/US99/22886 



be"? -51 7GCTGGACG C GGGCGCCGAC CG0ATCG3CG AGATCCTGGG CGAACTGCTC CGGGTGTTCG 

568:.': 1 ACGCGGGCGC GC7GGAGCCG C7GCC3GTCC G7GCCTGGGA CGTCCGGCAG GCAGGCGACG 

5683] C GC TCGGCTG G AT GAGGCGC 300CGCCACA 7CGGCAAGAA CGTCCCGACG CTGGCCCGGC 

5694 1 CGCTCGACCC GGAGGGGG GG GTCGTCCTCA CCGGCGGCTC GGGCAGGGTG GCCGGCATCC 

5 7 501 CCGCCCGCCA CGTGGGGGAA CGGCATGTC7 ACCTGCTGCC CGGGAGGGCA CCGCCCGAGG 

5^061 OGACGCCCOG CG7CCACCTG CCCTGCGACG 7CGGTGAC0G GGACCAGCTG GGGGCGGCCC 

v. ■ i ^ 1 GGAGCGGGC G5AGGGGGGG "irCrv~CGCC'~ T'j „;T — - _ GTGCG CTGGACGACG 

5^15: GGAGGGTCGG CTCGCTCACC 0070AGCGT7 TCGAGAGGG 7 GCTGGGGGGG AAGGCCGACG 

GGCCGCGTTC GTGCTCTACT 

5"' 301 CGTCGGGCGC CGGCGTGCTG* GGGAACGCGG GCCAGGGCAA CTACGTCGCG GGGAACGCGT 

57 361 7CC7CGACGC GGTCGCCGAG C7GGGCGAGG GTTCCGGGCT GCGGGGGCTG TCCATCGCCT 

57421 GGGGGCTCTG GGAGGACGTG AGCGGGCTCA CGGCGGCGGT GGGCGAAGCC GACCGGGACC 

574 61 GGATGCGGCG CAGCGGTTTC CGGGCC AT C A CCGCGCAACA GGGCATGCAC CTGTACGAGG 

57 54 1 CGGCCGGCCG CAGCGGAAGT CCCG7GGTGG TCGCGGCG3C GCTCGACGAC GCGCCGGACG 

57 601 TGCCGCTGCT GCGCGGCCTG CGGCGGACGA CGGTGGGGGG GGCCGCCGTC CGGGAGTGTT 

57 661 CGTCCGCCGA CCGGCTCGCC ZZGCZGkZGG GCGAGGAGCT GGCCGAAGCG CTGCTGACGC 

5 nn 2l TCGTCCGGGA GAGCACCGCC GCCGTGCTCG GCGACGTGGG TGGCGAGGAC ATCCCCGCGA 

5770 1 CGGCGGCCTT CP-AG GO 1CTC GG'JATCGACT CGCTCACCGC GGTCCAGCTG CGCAACGCCC 

57 8 4 ICACCGAGGC GAGCGGTGTG C03CTGAACG CCACG00GG7 CTTCGACTTC GCGACCCCGC 

57 901 AGGTGGTCGC CGGGAAGC7C 03CGA03AAC TGACGGGGAC CCGCGCGCCC GTCGTGCCCC 

57 961 GGACGGCGGC CACGGCCGGT GCGCACGACG AGCCGC73C3 GATCGTGGGA ATGGCC^GCC 
55021 GGCTGCCCGG CGGGGTCGCG TGACCCGAGG AG CT 3 TOG OA C3TCGTGGCA TCCGGCACCG 
5806 1 ACGCCATCAC GGAGTTCCCG AC G G AC CGCG GCTGCGACG7 CGACGCGATC TACGACCCGG 
5514 1 ACCCCGACGC GATCGGCAAG A3CTTCGTCC GGCAGGG7GG CTTCCTCACC GGCGCGACAG 
53 201 GCTTCGACGC GGCGTTCTTC GG2ATCAGCC CGCGCGAGGG CCTCGCGATG GACCCGCAGC 
5 8261 AGCGGGTGC7 CCTGGAGACG TCGTGGGAGG CGTTCGAAAG GGCCGGCA.TG ACCCCGGACT 
53 321 CGACCCGCGG CAGCGACACC GGCGTGTTCG TCGGCGCC7T CTCCTACGGT TACGGGACCG 
5 8 381 GTGCGGAGAC CGACGGCTTC GGCGCGACCG GCTCGGAGAG CAGTGTGCTC "TCGGGGGGGC 
5S44 1 TGTCGTACTT CTACGGTGTG GAGGGTCCGG CGGTCACGGT CGACACGGCG TGTTCGTCGT 
53501 CGCTGGTGGC GCTGCACCAG GCCGGGCAGT CGCTGCGCTC CGGCGAATGC TCGCTCGCCC 
58 5 61 7GGTCGGCGG CGTCACGGTG ATGGCGTCTC CCGGCGGCTT GGTGGAGTTC TCCCGGCAGC 
5 8 621 GCGGCCTCGC GGZGGAGGGZ CGGGCGAAGG CGTTCGGCGC GGGTGCGGAC GGCACGAGGT 
5 8 681 TCGCCGAGGG TGCCGGTGTG CTGATCGTCG AGAGGCTCTC CGACGCCGAA CGCAACGGTC 

587 4 1 ACACCGTCCT GGCGGTCGTG CGTGGTTCGG CGGTCAACCA GGATGGTGCC TCCAACGGGC 
583 01 TGTCGGCGCC GAACGGGCCG TCGCAGGAGC GGGTGATCCG GCAGGCCCTG GCCAACGCCG 

588 61 GGCTCACCCC GGCGGAGGTG GACGCCGTCG AGGGCCACGG CACGGGCACC hGGCTGGGCG 

58 92! ACCCCATCGA GGG AC AGGCG G7ACTGGCCA CCTACGGACA GGAGCGCGCC ACCCCCCTGC 
5 5 981 TGCTGGGCTG GGTGAAGTCG AACATCGGCC ACGCCGAGGC CGCGTCCGGC GTCGCCGGCA 
5904 1 TCATCAAGAT GGTGC AGGCG CTCCGGCACG GGGAGCTGCC GCCGACGCTG ChCGCCGkGG 
59101 AGCCGTCGCC GCACGTCGAG 7GGACGGCCG GCGCGGTGGA ACTGCTGACG TCGGCCCGGC 
5 9161 GGTGGCCCGA GAGCGACCGG CGAGGGCGTG CCGCCGTCTG GTCGTTCGGG GTGAGCGGCA 
5 9221 CCAACGCCCA CGTCATCCTG GAGGCCGGAC CGGTAACGGA GACGCCCGCG GCATCGCCTT 
5 9281 CCGGTGACCT TCCCCTGCTG GTG7CGGCAC GCTCACCGGA AGCGCTCGAC GAGCAGATCC 
5 934 1 GCCGACTGCG CGCCTACCTG GACACCACCC CGGACGTCGA CCGGGTGGCC GTGGCACAGA 
594 01 CGCTGGCCCG GCGCACACAC TTCGCCCACC GCGCCGTGCT GCTCGGTGAC AGCGTCATCA 
5 9461 CCACACCCCC CGCGGACCGG CCCGACGAAC TCGTCTTCGT CTACTCCGGC CAGGGCACCC 
5952 1 AGCATCCCGC GATGGGCGAG CAGCTCGCCG CCGCCCATCG CGTGTTCGCC GACGCCTGGC 
5958 1 ATGAAGCGCT CCGCCGCCTT GACAACCCCG ACCCCCACGA CCCCACGCAC AGCCAGCATG 
5964 1 TGCTCTTCGC CCACCAGGCG GCG7TCACCG CCCTCCTGCG GTCC'TGGGGG ATCACCCCGC 
5 97 CI ACGCGGTCAT CGGCCACTCG CTGGGCGAGA TCACCGCGGC GCACGCCGCC GGCATCCTGT 
5 9761 CGCTGGACGA CGGGTGCACC CTGA7CACCA CGCGCGCCCG CCTCATGCAC ACGCTCCCGC 
59821 CACCCGGTGC CATGGTGACC G7ACTGACCA GCGAAGAGAA GGCACGCCAG GCGTTGCGGC 
5 9881 CGGGCGTGGA GATCGCCGCC GTCAACGGGC CCCAC7CCAT CGTGCTGTCC GGGGACGAGG 
5994 1 ACGCCGTGCT CACCGTCGCC GGGCAGCTCG GCATCCACCA CCGCCTGCCC GCGCCGGACG 
6000 1 CCGGGCACTC CGCGCACATG GAGCCCGTGG CCGCCGAGCT GCTCGCCACC ACCCGCGGGC 
60061 TCCGCTACCA CCCTCCCCAC ACGTCCATTC CGAACGACCC CACCACCGCT GAGTACTGGG 
60121 CCGAGCAGGT CCGCAAGCCC GTGCTG77CC ACGCCCACGC GCAGCAGTAC CCGGACGCCG 
60181 TGTTCGTGGA GATCGGCCCC GCCCAGGACC TCTCCCCGCT CGTCGACGGG ATCCCGCTGC 
6024 1 AGAACGGCAC CGCGGAGGAG GTGCACGCGC TGCACACCGC GCTCGCGCAC CTCTACGCGC 
60301 GCGGTGCCAC GCTCGACTGG GCCGGCATCC TCGGGGC7GG GTCACGGCAC GACGCGGATG 
60361 TGCCCGCGTA CGCGTTCCAA CGGCGGCACT ACTGGATCGA GTCGGCACGC CCGGCCGCAT 
604 21 CCGACGCGGG CCACCCCGTG CTGGGCTCCG GTATCGCCCT CGCCGGGTCG CCGGGCCGGG 
604 81 TGTTCACGGG TTCCGTGCCG ACCGGTGCGG ACCGCGCGGT GTTCGTCGCC GAGCTGGCGC 
60541 TGGCCGCCGC GGACGCGGTC GACTGCGCCA CGGTCGAGCG GCTCGACATC GCCTCCGTGC 
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6S28I GGG^GGCGCZ CACCGCCCAC CTGCTCGAC Z TGATCGACGC TCCCACCGCC CGGATCGCCG 

08 34 1 G3GAGTCCCG GCCCGCGGCG - ^ - G GG OCC CCGT71G0GGC CGCGGGGGAC CAGGACGAGC 

<o;;40jL GGATCGCCAG7' GGT GG; GG AG 'J - -G .-CGGG7 ToCC .„GG7GG T^TGACGTCG CCCGAGGACC 

c B 4 o . 7 ?: iGGGCGGC - GGGGGAGGGG -;-;-^. t4 .^ .., ^ACG 7." 7 C.-vCCAC GGGTGGTGAC GACCGCGGCT 

o^GGl GGGACG7CGA CGCGC7GTAG GAG GG J GAG 7" CGGGAG- GG GG G GGGGAAGGGG 7 AC AAC C T G C 

~ F : 1 : J -Tw'j w ^ ,>G t ^;\: -oA : ■• r - v - .*"•.'_) . ' ^ - J * - » ^'o:vC ATCAiGTCC^oC 

OSCi i '.J'^'or^-.c'v J „ - v,/-» i 1 ^ ^;.-V~ '^.'u".,. .. 7 .j .. - w ^ „- 'Jtr^J*^ r*.^ , ( oo ^ j TG>ovj A.GGCG A 

6S^1 .G ^AGJ'jCGu CCGGAG'CA.G T CC^GCGTCGC 7 w 7o., ; j777 GoAGGTCGGC GTGTATGTCG 

be <' j G'GGGGCG v., GCAGGGGGAG GG^CTGGGCG GG \jr-. J .~.C C^-^GoGCCAC GGGATCACCG 

6 G 3 2 i GGGG77CCAC GAGCCTGCCG TCCGGACGGG 7GGCGGA7GGC GCTCGGGC7G GAGGGCCCGG 

6 S B G 1 GG G TGAGGGT GGACACGGCG T3C7CGTCGT GGGTGGGGGG GGTGGATGTG GGGTGCCAGG 

c ti ^ -j a. '..i ■oCI oCGCCT i oG-_;CG < .".GGGC GA.ACGCGCTC .G37 >- G'^TC - lGGTA. CTGAGTTCGC 

6 9 06 1 : CGGCCGCG7T CGGGGAGTTC 7CCCGCC.AGC GCGGGC7CGG GGCCGACGGG CGC7GCAAGT 

6 9 Jc : G .j'T'Gv^Ljv^Co 1 ^ GoGCuCG'oj-vC v.-vjuACGrvCG^ ^Gl ... . ^'-.h-vv'o^oT^ C-.CGiA.CTGG 

c > _ 2 - vCGG^TCGC '^GACGCCur-.G l^GC - CvjoGC A.C.*~.G ^- ~ - . G 1 Go^CGTCGTC CoCooCAGCG 

6 913 7 CCGTCA.CGTG CGACGGCGCC CCCAACGGCC GGAGGG^GGG GAAG7GGGCTC 7CGCAGCAGC 

c9^'! : ^GT CATCCo GAAGGCGCTC GCCGCGGCCG i oo CT^.- G G-oG CGCCGA.CGTG GACGTCGTCG 

6 9 J 0 j. A.GGGGCACGG CACCGGCACC CGGCTCGGCG .-. GCCv- ,7 7 2GA. GGGGGAGGGG CTGCTCGCGA. 

o9GtI CGTACGGGCA GGACCGTCCG GCACCGGTC7 G77CG > oG jGTC GCTGAG-.G7CG AACA7CGGAC 

o '-J /. i .".TG^- CA.CGO'^ ^\jCGG\_CG ~ vjTvo : -v;c^^'o ±-^/\. Go/oGAa^o A.7CGGCGCGG 

6 9 4c 7 G C A C G A T G G C G C C G AG G G G G G AT G T 3 G .AG G AG C G G G G G G G C G C G G T C G A.C T G G AG C AC C G 

c 9 5 '3 . G A. G A G G T G G C G C T G C T C G G G G 1 G G Aa G G ^ G G 77 ? ..j j G ~ ,.- v.: o Gv w uAuCo? GGGTGGGGGG 

6 96 'j i CGGCCGTCTC GGCGTTCGGG G r ?CP\GCGGG:\ G^r-^\ J : .;G-. CGTCA.TCGrG GA-ACAGCACG 

6 9 6 C ■ -l v.: i G CG jCGC^. G - G ' J G G G G '«.. ^r/j^l.'jC^G- G — ^ - ; . - — : 1 o o 'o * ^ ^ ^ r\ o G ^ G T o G 

69721 GGTGGGTGGT CTCGGGGCGG A.CTCCGGCCG GGGTGGGGGG CZP^GGZGGZZ G3GC7GZGGG 

69GS1 .ACCACCTCGC GGCGGGACGG GACGGGGATG CGTTGGAGAG GGGGGAGGGG CTGGCGACCA 

693 41 GCCGCGCCCA GTTCGCCCAC CGTGCCGCGG TGGTGGGGAG CACCCCGGAC GGA7TCCGTG 

699G1 CCGCGC7CGA CGGCC7CGCG GACGGGGGGG AGGGGGGGGG AGTCGTGAGG GGGACCGCTC 

69961 AGGAGCGGCG CGTCGCC7TC GTGTTCGAGG GCGkGGGZGZ CCAGCGCGCG GGAATGGGGC 

"7C021 GCGAGCTCCA CCGCCGG7TC CCGGTGTTGG CCGCCGCGTG GGACGAGG7G rCGGACGCGT 

7 COS. TGGGCAAGGA CCTCAAGCAC TCCCCCACGG AGG7GTAGGA GGGCGA.AGAG GGGGGTGTCG 
7014 1 CCCATGACAC CCTG7ACGCC GAGGCCGGGC TG77GAGGGT CGAAGTGGCG CTGCTGCGGC 
7 02C1 7GC7GGAGCA CTGGGGGG7G CGGCCGGACG TGCTGGTCGG GCACTGCGTC GGCGAGGTGA 
7 0261 CGGCGGCGTA CGCGGCGGGG G7GCTCACCC TGGGGGAGGG GACGGAGTTG ATCGTGGCCC 
"0321 GGGGGCGGGC GCTGCGGGGG CTGCCGCCCG GGGGGAGGGG CGCGGGGGAC GGAAGCCCGG 
7 0 361 CGGAGGTCGG CGCCCGCAGG GATCGGGACA TCGCCGCGGG CAACGGCCGG TCCGCCGTGG 
704 4 1 TGCTCGCCGG TTCGCCGGAC GATG7GGCGG CGTTCGA.ACG GGAG7GGGGG GGGGGCGGGC 
7 0501 GGCGCACGAA ACGGC7CGAC GZGGGGGkZG CGTTCCAC7C GGGGCAGGTC G^CGGTGCGC 
7056 7 TCGACGGCTT GGGTACGGTG CTGGAGTCGC 7CGCG77CGG CGCGGCGGGG CTGCCGGTGG 

A7 c G , G C:CGCP\CG?'/.2 G/~,CGGGCCG j.nG'^GG G — oG ^ — .''j^Lou^Co^o l-.^v. . Cu^TuC 

/ G c 3 7 GGGAGGGGCG 7CGGCCGGTG CTGTTC7CGG AGIGGGGGGGG oGAoGTGGGC G^GCGCGGGG 
7074 1 TGACCACGT7 GGTGGCGGGG GGCCCCTCCG GC7CGGTGGG G7CGGCCGCG GGGGAGAGCG 
7 0801 CCGGGGAGGA CGCCGGGACC T AC CACGCGG TGGTGGGGGG CCGGACCGGT GAGGAGACCG 
70361 GGGCGGTGAC CGCCCTCGCC GAGCTGGAGG CCCACGG C3T CCCGGTCGAC CTGGCCGCGG 
7 0 921 7 AC: GGCCGG 7GGCCGGGCA GTGGACC77C CCGTGTAGGG GTTCCAGCAC CG7TCCTACT 
70981 GGGTGGCCCC GGCCGTGGCG GGGGCGCCGG CCAGGGTGGG GGACACCGGG GGTCCGGCGG 
71041 AG7CCGAGCC GGAGGACGTC ACCGTCGCCG AGATCG7CCG TCGGCGCACC GCGGCGCTGC 
7 1101 TCGGCGTCAC GGACCCCGGC GACGGGGATG CGGAAGCGAC GTTCTTCGCG CTCGGTTTCG 
71161 AGTCACTGGC GGGGGAGGGG CTGCGCAAC r T ^CTGGGGTG GGCAACCGGG CTGGACCTGC 
7122. CGGGGGCCGT CGGGTTCGAG GAGGAGACCG CGGGGGGGGG CACCGCGTTC CTCCAGGACC 
71231 GGATGGAGGC CGGGCAGGAG CGGATCGAGG CCGGGGAGGA CGACGACGGG CCCACCGTGC 
7 134 1 TCTCGCTCCT GGAGGAGAGG GAGTCGC7CG ACGCCGCGGA CATCGCGGCG ACGCCGGCCC 
714 01 CGGAGGGTGC GGCCATCGCC GATCTGCTCG ACAAGCTCGC CCATACCTGG AAGGACTACC 
714 61 GATGAGCACC GATACGCACG AGGGAACGCC GCCCGCCGGC CGCTGCCCAT TCGCGATCCA 
71521 GGACGGTCAC CGCGCCATCC TGGAGAGGGG CACGGTGGGT TCGTTCGACC TGTTCGGCGT 
71581 CAAGCACTGG CTGGTCGCCG CCGCCGAGGA CGTCAAGC7G GTCACCAACG ATCCGCGGTT 
71641 CAGGTCGGGC GCGCCGTCCG AGATGC7GGC CGAGCGGGGG CCCGGCTGGT TCTCCGGGAT 
71701 GGACTCACCG GAGCACAACC GCTACCGGCA GAAGATCGGG GGGGACTTCA CACTGCGCGC 
7 17 67 GGGGCGCAAG CGGGAGGACT TCGTCGCCGA GGCCGCCGAC GGCTGCCTGG ACGACATCGA 
71321 GGCCGCGGGA CCCGGChCCG ACCTCATCCC CGGGTACGGC AAGCGGCTGC CCTCCCTCGT 
7 1331 CATCAACGCG CTGTACGGGG TCACCCCTGA GGAGGGGGGG GTGCTGGAGG CACGGATGCG 
71941 CGACATCACC GGCTCGGCCG ATCTGGAGAG CGTCAAGACG CTGACCGACG ACTTCTTCGG 
72001 GCACGCGGTG CGGCTGGTCC GCGCGAAGCG TGACGAGCGG GGCGAGGACC TGCTGCACCG 
7 2061 GCTGGCCTGG GCCGACGACG GCGAGA7CTC GCTCAGCGAC GACGAGGCGA CGGGCGTGTT 
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ACTGCCTCTA 
AGCTGGTCGA 
TACCGGGCGT 
ACG7GATCCC 
CG77CGATGT 
G7CCCGGCCA 
GTTTCCCGGA 
GGCCGGCCGA 
G7TGCCGAAC 
GGAGATC7TC 
GGTGTTCGAC 
7GG7GTGACC 
GGTGACGCGG 
CACCCGGAAG 
GGCGGCCCGG 
CG7CG.ACACC 
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CCGCC4AGGTC 
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CTGGGTGCCG 
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TTGCCCACGG 
CGGCTCGGCG 
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CGCGGCCTCG 
GCGGAACCCC 
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GGGGT GGGGG 
GCGGTGCCGG 
ACGCGGCGGA 
G7GCGCCCGG 
CTCGCCCGGC 
GAGGAGCCGT 
ATCGCAGAGC 
GTCGGAGGCC 
G7GGTGCTCG 
GGTACCCATC 
CAGCGCCAGC 
GCGGGCCGCG 
GTGGGCCTGC 
GACACTCTCG 
AT AG AG CG AC 
TGAGGTCCGT 
GCCGCCGGTC 
CGGGCGCAGG 
GTGGCGCAGA 
CTCGGTCAGC 
GCGCAGCGAC 
CGCGGCGAGC 
CGGCAGGTTT 
GGGCTGGTCG 
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Those of skill in the art will recognize that, clue to the degenerate nature of the 
genetic code, a variety of DNA compounds differing in their nucleotide sequences can be 
used to encode a given amino acid sequence of the invention. The native DNA sequence 
encoding the FK-520 PKS of Streptomyces hygroscopicus is shown herein merely to 
illustrate a preferred embodiment of the invention, and the present invention includes 
DNA compounds of any sequence that encode the amino acid sequences of the 
polypeptides and proteins of the invention. In similar fashion, a polypeptide can typically 
tolerate one or more amino acid substitutions, deletions, and insertions in its ammo acid 
sequence without loss or significant loss of a desired activity. The present invention 
includes such polypeptides w r ith alternate amino acid sequences, and the amino acid 
sequences shown merely illustrate preferred embodiments of the invention. 

The recombinant nucleic acids, proteins, and peptides of the invention are many 
and diverse. To facilitate an understanding of the invention and the diverse compounds 
and methods provided thereby, the following general description of the FK-520 PKS 
genes and modules of the PKS proteins encoded thereby is provided. This general 
description is followed by a more detailed description of the various domains and 
modules of the FK-520 PKS contained in and encoded by the compounds of the 
invention. In this description, reference to a heterologous PKS refers to any PKS other 
than the FK-520 PKS. Unless otherwise indicated, reference to a PKS includes reference 
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to a portion of a PKS. Moreover, reference to a domain, module, or PKS includes 
reference to the nucleic acids encoding the same and vice-versa, because the methods 
and reagents of the invention provide or enable one to prepare proteins and the nucleic 
acids that encode them. 
5 The FK-520 PKS is composed of three proteins encoded by three genes 

designated JkbA.fkbB, and fkbC. The fkbA ORF encodes extender modules 7 - 10 of the 
PKS. The fkbB ORF encodes the loading module (the CoA ligase) and extender modules 
1 - 4 of the PKS. The fkbC ORF encodes extender modules 5 - 6 of the PKS. The fkhP 
ORF encodes the NRPS that attaches the pipecolic acid and cyclizes the FK-520 
F' polyketide. 

The loading module of the FK-520 PKS includes a C oA ligase, an ER domain, 
and an ACP domain. The starter building block or unit for FK-520 is believed to be a 
dihydroxycyclohexene carboxylic acid, which is derived from shikimate. The 
recombinant DNA compounds of the invention that encode the loading module of the 

15 FK-520 PKS and the corresponding polypeptides encoded thereby are useful for a 
variety of methods and in a variety of compounds. In one embodiment, a DNA 
compound comprising a sequence that encodes the FK-520 loading module is inserted 
into a DNA compound that comprises the coding sequence for a heterologous PKS. The 
resulting construct, in which the coding sequence for the loading module of the 

20 heterologous PKS is replaced by the coding sequence for the FK-520 loading module, 
provides a novel PKS coding sequence. Examples of heterologous PKS coding 
sequences include the rapamycim FK-506, rifamycin, and avermectin PKS coding 
sequences. In another embodiment, a DNA compound comprising a sequence that 
encodes the FK-520 loading module is inserted into a DNA compound that comprises the 

25 coding sequence for the FK-520 PKS or a recombinant FK-520 PKS that produces an 
FK-520 derivative. 

In another embodiment, a portion of the loading module coding sequence is 
utilized in conjunction with a heterologous coding sequence. In this embodiment, the 
invention provides, for example, either replacing the CoA ligase with a different CoA 

30 ligase, deleting the ER, or replacing the ER with a different ER. In addition, or 
alternatively, the ACP can be replaced by another ACP. In similar fashion, the 
corresponding domains in another loading or extender module can be replaced by one or 
more domains of the FK-520 PKS. The resulting heterologous loading module coding 
sequence can be utilized in conjunction with a coding sequence for a PKS that 

35 synthesizes FK-520, an FK-520 derivative, or another polyketide. 
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The first extender module of the FK-520 PKS includes a KS domain, an AT 
domain specific for methylmalonyl CoA. a DH domain, a KR domain, and an ACP 
domain. The recombinant DNA compounds of the invention that encode the first 
extender module of the FK-520 PKS and the corresponding polypeptides encoded 
5 thereby are useful for a variety of applications. In one embodiment, a DNA compound 
comprising a sequence that encodes the FK-520 first extender module is inserted into a 
DNA compound that comprises the coding sequence for a heterologous PKS. The 
resulting construct, in which the coding sequence for a module of the heterologous PKS 
is either replaced by that for the first extender module of the FK-520 PKS or the latter is 

10 mcrciy ui .o coding sequences for modules of the jlogous PKS, provides a 

novel PKS coding sequence. In another embodiment, a DNA compound comprising a 
sequence that encodes the first extender module of the FK-520 PKS is inserted into a 
DNA compound that comprises the remainder of the coding sequence for the FK-520 
PKS or a recombinant FK-520 PKS that produces an FK-520 derivative. 

15 In another embodiment, all or only a portion of the first extender module coding 

sequence is utilized in conjunction with other PKS coding sequences to create a hybrid 
module. In this embodiment, the invention provides, for example, either replacing the 
methylmalonyl CoA specific AT with a malonyl CoA, ethylmalonyl CoA, or 2- 
hydroxymalonyl CoA specific AT; deleting either the DH or KR or both; replacing the 

20 DH or KR or both with another DH or KR; and/or inserting an ER. In replacing or 

inserting KR, DH, and ER domains, it is often beneficial to replace the existing KR, DH, 
and ER domains with the complete set of domains desired from another module. Thus, if 
one desires to insert an ER domain, one may simply replace the existing KR and DH 
domains with a KR, DH, and ER set of domains from a module containing such 

25 domains. In addition, the KS and/or ACP can be replaced with another KS and/or ACP. 
In each of these replacements or insertions, the heterologous KS, AT, DH, KR, ER, or 
ACP coding sequence can originate fr a co^ng sequence for another module of the 
FK-520 PKS, from a gene for a PKS that produces a polyketide other than FK-520, or 
from chemical synthesis. The resulting heterologous first extender module coding 

30 sequence can be utilized in conjunction with a coding sequence for a PKS that 

synthesizes FK-520, an FK-520 derivative, or another polyketide. In similar fashion, the 
corresponding domains in a module of a heterologous PKS can be replaced by one or 
more domains of the first extender module of the FK-520 PKS. 

In an illustrative embodiment of this aspect of the invention, the invention 

35 provides recombinant PKSs and recombinant DNA compounds and vectors that encode 
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such PKSs in which the KS domain of the first extender module has been inactivated. 
Such constructs are especially useful when placed in translational reading frame with the 
remaining modules and domains of an FK-520 or FK-520 derivative PKS. The utility of 
these constructs is that host cells expressing, or cell free extracts containing, the PKS 
5 encoded thereby can be fed or supplied with N-acylcysteamine thioesters of novel 
precursor molecules to prepare FK-520 derivatives. See U.S. patent application Serial 
No. 60/1 1 7,384, filed 27 Jan. 1999, and PCT patent publication Nos. US97/02358 and 
US99/03986, each of which is incorporated herein by reference. 

The second extender module of the FK-520 PKS includes a KS, an AT specific 

10 for methylmalonyl CoA, a KR, an inactive DH, and an ACP. The recombinant DNA 

compounds of the invention that encode the second extender module of the FK-520 PKS 
and the corresponding polypeptides encoded thereby are useful for a variety of 
applications. In one embodiment, a DNA compound comprising a sequence that encodes 
the FK-520 second extender module is inserted into a DNA compound that comprises the 

15 coding sequence for a heterologous PKS. The resulting construct, in which the coding 
sequence for a module of the heterologous PKS is either replaced by that for the second 
extender module of the FK-520 PKS or the latter is merely added to coding sequences 
for the modules of the heterologous PKS, provides a novel PKS coding sequence. In 
another embodiment, a DNA compound comprising a sequence that encodes the second 

20 extender module of the FK-520 PKS is inserted into a DNA compound that comprises 

the coding sequence for the remainder of the FK-520 PKS or a recombinant FK-520 PKS 
that produces an FK-520 derivative. 

In another embodiment, all or a portion of the second extender module coding 
sequence is utilized in conjunction with other PKS coding sequences to create a hybrid 

25 module. In this embodiment, the invention provides, for example, either replacing the 
methylmalonyl CoA specific AT with a malonyl CoA, ethylmalonyl CoA, or 2- 
hydroxymalonyl CoA specific AT; deleting the KR and/or the inactive DH; replacing the 
KR with another KR; and/or inserting an active DH or an active DH and an ER. In 
addition, the KS and/or ACP can be replaced with another KS and/or ACP. In each of 

30 these replacements or insertions, the heterologous KS, AT, DH, KR, ER, or ACP coding 
sequence can originate from a coding sequence for another module of the FK-520 PKS, 
from a coding sequence for a PKS that produces a polyketide other than FK-520, or from 
chemical synthesis. The resulting heterologous second extender module coding sequence 
can be utilized in conjunction with a coding sequence from a PKS that synthesizes FK- 

35 520, an FK-520 derivative, or another polyketide. In similar fashion, the corresponding 
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domains in a module of a heterologous PKS can be replaced by one or more domains of 
the second extender module of the FK-520 PKS. 

The third extender module of the FK-520 PKS includes a KS, an AT specific for 
malonyl Co -V a KR. an inactive DLL ana an ACP. The recombinant DNA compounds of 
5 the invention that encode the third extender module of the FK-520 PKS and the 

corresponding polypeptides encoded thereby are useful for a variety of applications. In 
one embodiment, a DNA compound comprising a sequence that encodes the FK-520 
third extender module is inserted into a DNA compound that comprises the coding 
sequence for a heterologous PKS. The resulting construct, in which the coding sequence 

10 for a module of the heterologous PKS is eitner replaced by that for the third extender 
module of the FK-520 PKS or the latter is merely added to coding sequences for the 
modules of the heterologous PKS, provides a novel PKS coding sequence. In another 
embodiment, a DNA compound comprising a sequence that encodes the third extender 
module of the FK-520 PKS is inserted into a DNA compound that comprises the coding 

15 sequence for the remainder of the FK-520 PKS or a recombinant FK-520 PKS that 
produces an FK-520 derivative. 

In another embodiment, all or a portion of the third extender module coding 
sequence is utilized in conjunction with other PKS coding sequences to create a hybrid 
module. In this embodiment, the invention provides, for example, either replacing the 

20 malonyl CoA specific AT with a methylmalonyl CoA, ethylmalonyl CoA, or 2- 

hydroxymalonyl CoA specific AT; deleting the KR and/or the inactive DH; replacing the 
KR with another KR; and/or inserting an active DH or an active DH and an ER. In 
addition, the KS and/or ACP can be replaced with another KS and/or ACP. In each of 
these replacements or insertions, the heterologous KS, AT, DH, KR, ER, or ACP coding 

25 sequence can originate from a coding sequence for another module of the FK-520 PKS, 
from a coding sequence for a PKS that produces a polyketide other than FK-520, or from 
chemical synthesis. The resulting heterologous third extender module coding sequence 
can be utilized in conjunction with a coding sequence from a PKS that synthesizes FK- 
520, an FK-520 derivative, or another polyketide. In similar fashion, the corresponding 

30 domains in a module of a heterologous PKS can be replaced by one or more domains of 
the third extender module of the FK-520 PKS. 

The fourth extender module of the FK-520 PKS includes a KS, an AT that binds 
ethylmalonyl CoA, an inactive DH, and an ACP. The recombinant DNA compounds of 
the invention that encode the fourth extender module of the FK-520 PKS and the 

35 corresponding polypeptides encoded thereby are useful for a variety of applications. In 
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one embodiment, a DNA compound comprising a sequence that encodes the FK-520 
fourth extender module is inserted into a DNA compound that comprises the coding 
sequence for a heterologous PKS. The resulting construct, in which the coding sequence 
for a module of the heterologous PKS is either replaced by that for the fourth extender 
module of the FK-520 PKS or the latter is merely added to coding sequences for the 
modules of the heterologous PKS, provides a novel PKS coding sequence. In another 
embodiment, a DNA compound comprising a sequence that encodes the fourth extender 
module of the FK-520 PKS is inserted into a DNA compound that comprises the 
remainder of the coding sequence for the FK-520 PKS or a recombinant FK-520 PKS 
that produces an FK-520 derivative. 

In another embodiment, a portion 01 the fourth extender module coding sequence 
is utilized in conjunction w ith other PKS coding sequences to create a hybrid module. In 
this embodiment, the invention provides, for example, either replacing the ethylmalonyl 
CoA specific AT w ith a malonyl CoA, methylmalonyl CoA, or 2-hydroxymalonyl CoA 
specific AT; and/or deleting the inactive DH, inserting a KR, a KR and an active DH, or 
a KR. an active DH, and an ER. In addition, the KS and/ or ACP can be replaced with 
another KS and/or ACP. In each of these replacements or insertions, the heterologous 
KS, AT, DH, KR, ER, or ACP coding sequence can originate from a coding sequence for 
another module of the FK-520 PKS, a PKS for a polyketide other than FK-520, or from 
chemical synthesis. The resulting heterologous fourth extender module coding sequence 
can he utilized in conjunction with a coding sequence for a PKS that synthesizes FK-520, 
an FK-520 derivative, or another polyketide. In similar fashion, the corresponding 
domains in a module of a heterologous PKS can be replaced by one or more domains of 
the fourth extender module of the FK-520 PKS. 

As illustrative examples, the present invention provides recombinant genes, 
vectors, and host cells that result from the conversion of the FK-506 PKS to an FK-520 
PKS and vice-versa. In one embodiment, the invention provides a recombinant set of 
FK-506 PKS genes but in which the coding sequences for the fourth extender module or 
at least those for the AT domain in the fourth extender module have been replaced by 
those for the AT domain of the fourth extender module of the FK-520 PKS. This 
recombinant PKS can be used to produce FK-520 in recombinant host cells. In another 
embodiment, the invention provides a recombinant set of FK-520 PKS genes but in 
which the coding sequences for the fourth extender module or at least those for the AT 
domain in the fourth extender module have been replaced by those for the AT domain of 
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the fourth extender module of the FK-506 PKS. This recombinant PKS can be used to 
produce FK-506 in recombinant host cells. 

Other examples of hybrid PKS enzymes of the invention include those in which 
the AT domain of module 4 has been replaced with a malonyl specific AT domain to 
5 provide a PKS that produces 21 -desethyI-FK520 or with a methylmalonyl specific AT 
domain to provide a PKS that produces 2 1 -desethyl-2 1 -methyl-FK520. Another hybrid 
PKS of the invention is prepared by replacing the AT and inactive KR domain of FK-520 
extender module 4 with a methylmalonyl specific AT and an active KR domain, such as, 
for example, from module 2 of the DEBS or oleandolide PKS enzymes, to produce 21- 

10 desethyt ^ l~methy'-22-desoxu-22-hydroxy-FK520. The compounds produced by these 
hybrid PKS enzymes are neurotrophins. 

The fifth extender module of the FK-520 PKS includes a KS, an AT that binds 
methylmalonyl CoA, a DH, a KR, and an ACP. The recombinant DNA compounds of 
the invention that encode the fifth extender module of the FK-520 PKS and the 

15 corresponding polypeptides encoded thereby are useful for a variety of applications. In 
one embodiment, a DNA compound comprising a sequence that encodes the FK-520 
fifth extender module is inserted into a DNA compound that comprises the coding 
sequence for a heterologous PKS. The resulting construct, in which the coding sequence 
for a module of the heterologous PKS is either replaced by that for the fifth extender 

20 module of the FK-520 PKS or the latter is merely added to coding sequences for the 
modules of the heterologous PKS, provides a novel PKS. In another embodiment, a 
DNA compound comprising a sequence that encodes the fifth extender module of the 
FK-520 PKS is inserted into a DNA compound that comprises the coding sequence for 
the FK-520 PKS or a recombinant FK-520 PKS that produces an FK-520 derivative. 

25 In another embodiment, a portion of the fifth extender module coding sequence is 

utilized in conjunction with other PKS coding sequences to create a hybrid module. In 
this embodiment, the invention provides, for example, either replacing the 
methylmalonyl CoA specific AT with a malonyl CoA. ethylmalonyl CoA, or 2- 
hydroxymalonyl CoA specific AT; deleting any one or both of the DH and KR; replacing 

30 any one or both of the DH and KR with either a KR and/or DH; and/or inserting an ER. 
In addition, the KS and/or ACP can be replaced with another KS and/or ACP. In each of 
these replacements or insertions, the heterologous KS, AT, DH, KR, ER, or ACP coding 
sequence can originate from a coding sequence for another module of the FK-520 PKS, 
from a coding sequence for a PKS that produces a polyketide other than FK-520, or from 

35 chemical synthesis. The resulting heterologous fifth extender module coding sequence 
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can be utilized in conjunction with a coding sequence for a PKS that synthesizes FK-520, 
an FK-520 derivative, or another polyketide. In similar fashion, the corresponding 
domains in a module of a heterologous PKS can be replaced by one or more domains of 
the fifth extender module of the FK-520 PKS. 
5 In an illustrative embodiment, the present invention provides a set of recombinant 

FK-520 PKS genes in which the coding sequences for the DH domain of the fifth 
extender module have been deleted or mutated to render the DH non-functional. In one 
such mutated gene, the KR and DH coding sequences are replaced with those encoding 
only a KR domain from another PKS gene. The resulting PKS genes code for the 

10 expressiL.. . . an FK-520 PKS that produces an FK-5"'"' 1 - .log that lacks the C-19 to 2" 
20 double bond of FK-520 and has a C-20 hydroxyl group. Such analogs are preferred 
neurotrophic, because they have little or no immunosuppressant activity. This 
recombinant fifth extender module coding sequence can be combined with other coding 
sequences to make additional compounds of the invention. In an illustrative embodiment, 

15 the present invention provides a recombinant FK-520 PKS that contains both this fifth 
extender module and the recombinant fourth extender module described above that 
comprises the coding sequence for the fourth extender module AT domain of the FK-506 
PKS. The invention also provides recombinant host cells derived from FK-506 
producing host cells that have been mutated to prevent production of FK-506 but that 

20 express this recombinant PKS and so synthesize the corresponding (lacking the C-19 to 
C-20 double bond of FK-506 and having a C-20 hydroxyl group) FK-506 derivative. In 
another embodiment, the present invention provides a recombinant FK-506 PKS in 
which the DH domain of module 5 has been deleted or otherwise rendered inactive and 
thus produces this novel polyketide. 

25 The sixth extender module of the FK-520 PKS includes a KS, an AT specific for 

methylmalonyl CoA, a KR, a DH 7 an ER, and an ACP. The recombinant DNA 
compounds of the invention that encode the ^'xth extender module of the FK-520 PKS 
and the corresponding polypeptides encoded thereby are useful for a variety of 
applications. In one embodiment, a DNA compound comprising a sequence that encodes 

30 the FK-520 sixth extender module is inserted into a DNA compound that comprises the 
coding sequence for a heterologous PKS. The resulting construct, in which the coding 
sequence for a module of the heterologous PKS is either replaced by that for the sixth 
extender module of the FK-520 PKS or the latter is merely added to coding sequences 
for the modules of the heterologous PKS, provides a novel PKS coding sequence. In 

35 another embodiment, a DNA compound comprising a sequence that encodes the sixth 
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extender module of the FK-520 PKS is inserted into a DNA compound that comprises 
the coding sequence for the remainder of the FK-520 PKS or a recombinant FK-520 PKS 
that produces an FK-520 derivative. 

In another embodiment, a portion of the sixth extender module coding sequence 
5 is utilized in conjunction with other PKS coding sequences to create a hybrid module. In 
this embodiment, the invention provides, for example, either replacing the 
methylmalonyl CoA specific AT with a rnalonyl CoA, cthylmalonyl CoA, or 2- 
hydroxymalonyl CoA specific AT; deleting any one, two. or ail three of the KR, DH, and 
ER; and/or replacing any one, two, or all three of the KR, DH, and ER with another KR, 

10 DH, and ER. In aauiuon. the KS and/or ACP can be replaced with another KS and/or 
ACP. In each of these replacements, the heterologous KS, AT. DH, KR, ER, or ACP 
coding sequence can originate from a coding sequence for another module of the FK-520 
PKS, from a coding sequence for a PKS that produces a polyketide other than FK-520, or 
from chemical synthesis. The resulting heterologous sixth extender module coding 

15 sequence can be utilized in conjunction with a coding sequence for a PKS that 

synthesizes FK-520, an FK-520 derivative, or another polyketide. In similar fashion, the 
corresponding domains in a module of a heterologous PKS can be replaced by one or 
more domains of the sixth extender module of the FK-520 PKS. 

In an illustrative embodiment, the present invention provides a set of recombinant 

20 FK-520 PKS genes in which the coding sequences for the DH and ER domains of the 
sixth extender module have been deleted or mutated to render them non-functional. In 
one such mutated gene, the KR, ER, and DH coding sequences are replaced with those 
encoding only a KR domain from another PKS gene. This can also be accomplished by 
simply replacing the coding sequences for extender module six with those for an 

25 extender module having a methylmalonyl specific AT and only a KR domain from a 

heterologous PKS gene, such as, for example, the coding sequences for extender module 
two encoded by the eryAl gene. The resulting PKS genes code for the expression of an 
FK-520 PKS that produces an FK-520 analog that has a C-18 hydroxyl group. Such 
analogs are preferred neurotrophins, because they have little or no immunosuppressant 

30 activity. This recombinant sixth extender module coding sequence can be combined with 
other coding sequences to make additional compounds of the invention. In an illustrative 
embodiment, the present invention provides a recombinant FK-520 PKS that contains 
both this sixth extender module and the recombinant fourth extender module described 
above that comprises the coding sequence for the fourth extender module AT domain of 

35 the FK-506 PKS. The invention also provides recombinant host cells derived from FK- 
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506 producing host cells that have been mutated to prevent production of FK-506 but 
that express this recombinant PKS and so synthesize the corresponding (having a C-18 
hydroxyl group) FK-506 derivative. In another embodiment, the present invention 
provides a recombinant FK-506 PKS in which the DH and ER domains of module 6 have 
5 been deleted or otherwise rendered inactive and thus produces this novel polyketide. 

The seventh extender module of the FK-520 PKS includes a KS, an AT specific 
for 2-hydroxymalonyl CoA, a KR. a DH. an ER, and an ACP. The recombinant DNA 
compounds of the invention that encode the seventh extender module of the FK-52C PKS 
and the corresponding polypeptides encoded thereby are useful for a variety of 

10 applications. In one embodiment, a DNA ompound comprising a sequence tha f encodes 
the FK-520 seventh extender module is inserted into a DNA compound that comprises 
the coding sequence for a heterologous PKS. The resulting construct, in which the 
coding sequence for a module of the heterologous PKS is either replaced by that for the 
seventh extender module of the FK-520 PKS or the latter is merely added to coding 

15 sequences for the modules of the heterologous PKS, provides a novel PKS coding 
sequence. In another embodiment, a DNA compound comprising a sequence that 
encodes the seventh extender module of the FK-520 PKS is inserted into a DNA 
compound that comprises the coding sequence for the remainder of the FK-520 PKS or a 
recombinant FK-520 PKS that produces an FK-520 derivative. 

20 In another embodiment, a portion or all of the seventh extender module coding 

sequence is utilized in conjunction with other PKS coding sequences to create a hybrid 
module. In this embodiment, the invention provides, for example, either replacing the 2- 
hydroxyrnalonyl CoA specific AT with a methylmalonyl Co A, ethylmalonyl CoA, or 
malonyl CoA specific AT; deleting the KR, the DH, and/or the ER; and/or replacing the 

25 KR, DEL and/or ER. In addition, the KS and/or ACP can be replaced with another KS 
ancl/or ACP. In each of these replacements or insertions, the heterologous KS, AT, DH, 
KR, ER, or ACP coding sequence can originate from a coding sequence for another 
module of the FK-520 PKS, from a coding sequence for a PKS that produces a 
polyketide other than FK-520, or from chemical synthesis. The resulting heterologous 

30 seventh extender module coding sequence can be utilized in conjunction with a coding 
sequence for a PKS that synthesizes FK-520, an FK-520 derivative, or another 
polyketide. In similar fashion, the corresponding domains in a module of a heterologous 
PKS can be replaced by one or more domains of the seventh extender module of the FK- 
520 PKS. 
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In an illustrative embodiment, the present invention provides a set of recombinant 
FK-520 PKS genes in which the coding sequences for the AT domain of the seventh 
extender module has been replaced with those encoding an AT domain for malonyl, 
methylmalonyl, or ethylmalonyl CoA from another PKS gene. The resulting PKS genes 
5 code for the expression of an FK-520 PKS that produces an FK-520 analog that lacks the 
C-15 methoxy group, having instead a hydrogen, methyl, or ethyl group at that position, 
respectively. Such analogs are preferred, because they are more slowly metabolized than 
FK-520. This recombinant seventh extender module coding sequence can be combined 
with other coding sequences to make additional compounds of the invention. In an 

10 illustrative embodiment, the present invention provides a recombinant FK-520 PKS that 
contains both this seventh extender module and the recombinant fourth extender module 
described above that comprises the coding sequence for the fourth extender module AT 
domain of the FK-506 PKS> The invention also provides recombinant host cells derived 
from FK-506 producing host cells that have oeen mutated to prevent production of FK- 

15 506 but that express this recombinant PKS and so synthesize the corresponding (C-15- 
desmethoxy) FK-506 derivative. In another embodiment, the present invention provides 
a recombinant FK-506 PKS in which the AT domain of module 7 has been replaced and 
thus produces this novel polyketide. 

In another illustrative embodiment, the present invention provides a hybrid TKS 

20 in which the AT and KR domains of module 7 of the FK-520 PKS are replaced by a 
methylmalonyl specific AT domain and an inactive KR domain, such as, for example, 
the AT and KR domains of extender module 6 of the rapamycin PKS. The resulting 
hybrid PKS produces 15-desmcthoxy-15-methyl-16-oxo-FK-520, a neurotrophin 
compound. 

25 The eighth extender module of the FK-52C n KS includes a KS, an AT specific 

for 2-hydroxymalonyl CoA, a KR, and an ACP. The recombinant DNA compounds of 
the invention that encode the eighth extender module of the FK-520 PKS and the 
corresponding polypeptides encoded thereby are useful for a variety of applications. In 
one embodiment, a DNA compound comprising a sequence that encodes the FK-520 

30 eighth extender module is inserted into a DNA compound that comprises the coding 

sequence for a heterologous PKS. The resulting construct, in which the coding sequence 
for a module of the heterologous PKS is either replaced by that for the eighth extender 
module of the FK-520 PKS or the latter is merely added to coding sequences for the 
modules of the heterologous PKS, provides a novel PKS coding sequence. In another 

35 embodiment, a DNA compound comprising a sequence that encodes the eighth extender 
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module of the FK-520 PKS is inserted into a DNA compound that comprises the coding 
sequence for the remainder of the FK-520 PKS or a recombinant FK-520 PKS that 
produces an FK-520 derivative. 

In another embodiment, a portion of the eighth extender module coding sequence 
5 is utilized in conjunction with other PKS coding sequences to create a hybrid module. In 
this embodiment, the invention provides, for example, either replacing the 2- 
hydroxymalonyl CoA specific AT with a methylmalonyl CoA, ethylmalonyl CoA, or 
malonyl CoA specific AT; deleting or replacing the KR; and/or inserting a DH or a DH 
and an ER. In addition, the KS and/or ACP can be replaced with another KS and/or ACP. 

10 In each of these replacements, \\\c heterologous KS, AT. DH, KR, ER, or ACP coding 
sequence can originate from a coding sequence for another module of the FK-520 PKS, 
from a coding sequence for a PKS that produces a polyketide other than FK-520, or from 
chemical synthesis. The resulting heterologous eighth extender module coding sequence 
can be utilized in conjunction with a PKS that synthesizes FK-520, an FK-520 

15 derivative, or another polyketide. In similar fashion, the corresponding domains in a 
module of a heterologous PKS can be replaced by one or more domains of the eighth 
extender module of the FK-520 PKS. 

In an illustrative embodiment, the present invention provides a set of recombinant 
FK-520 PKS genes in which the coding sequences for the AT domain of the eighth 

20 extender module has been replaced with those encoding an AT domain for malonyl, 

methylmalonyl, or ethylmalonyl CoA from another PKS gene. The resulting PKS genes 
code for the expression of an FK-520 PKS that produces an FK-520 analog that lacks the 
C-13 methoxy group, having instead a hydrogen, methyl, or ethyl group at that position, 
respectively. Such analogs are preferred, because they are more slowly metabolized than 

25 FK-520. This recombinant eighth extender module coding sequence can be combined 
with other coding sequences to make additional compounds of the invention. In an 
illustrative embodiment, the present invention provides a recombinant FK-520 PKS that 
contains both this eighth extender module and the recombinant fourth extender module 
described above that comprises the coding sequence for the fourth extender module AT 

30 domain of the FK-506 PKS. The invention also provides recombinant host cells derived 
from FK-506 producing host cells that have been mutated to prevent production of FK- 
506 but that express this recombinant PKS and so synthesize the corresponding (C-13- 
desmethoxy) FK-506 derivative. In another embodiment, the present invention provides 
a recombinant FK-506 PKS in which the AT domain of module 8 has been replaced and 

35 thus produces this novel polyketide. 

55 



WO 00/20601 PCT/DS99/22886 

The ninth extender module of the FK-520 PKS includes a KS, an AT specific for 
methylmalonyl CoA, a KR, a DH, an ER, and an ACP. The recombinant DNA 
compounds of the invention that encode the ninth extender module of the FK-520 PKS 
and the corresponding polypeptides encoded thereby are useful for a variety of 
5 applications. In one embodiment, a DNA compound comprising a sequence that encodes 
the FK-520 ninth extender module is inserted into a DNA compound that comprises the 
coding sequence for a heterologous PKS. The resulting construct, in which the coding 
sequence for a module of the heterologous PKS is either replaced by that for the ninth 
extender module of the FK-520 PKS or the latter is merely added to coding sequences 

10 for the modules of the heterologous PKS, prow^ nn\el PK e coding sequence. In 
another embodiment, a DNA compound comprising a sequence that encodes the ninth 
extender module of the FK-520 PKS is inserted into a DNA compound that comprises 
the coding sequence for the remainder of the FK-520 PKS or a recombinant FK-520 PKS 
that produces an FK-520 derivative. 

15 In another embodiment, a portion of the ninth extender module coding sequence 

is utilized in conjunction with other PKS coding sequences to create a hybrid module. In 
this embodiment, the invention provides, for example, either replacing the 
methylmalonyl CoA specific AT with a malonyl CoA, ethylrnalonyl CoA, or 2- 
hydroxymalonyl CoA specific AT; deleting any one, two, or all three of the KR, DH, and 

20 ER; and/or replacing any one, two, or all three of the KR, DFh and ER with another KR, 
DH, and/or ER. In addition, the KS and/or ACP can be replaced with another KS and/or 
ACP. In each of these replacements, the heterologous KS, AT, DH, KR, ER, or ACP 
coding sequence can originate from a coding sequence for another module of the FK-520 
PKS, from a coding sequence for a PKS that produces a polyketidc other than FK-520, or 

25 from chemical synthesis. The resulting heterologous ninth extender module coding 

sequence can be utilized in conjunction with a PKS that synthesizes FK-520, an FK-520 
derivative, or another polyketide. In similar fashion, the corresponding domains in a 
module of a heterologous PKS can be replaced by one or more domains of the ninth 
extender module of the FK-520 PKS. 

30 The tenth extender module of the FK-520 PKS includes a KS, an AT specific for 

malonyl CoA, and an ACP. The recombinant DNA compounds of the invention that 
encode the tenth extender module of the FK-520 PKS and the corresponding 
polypeptides encoded thereby are useful for a variety of applications. In one 
embodiment, a DNA compound comprising a sequence that encodes the FK-520 tenth 

35 extender module is inserted into a DNA compound that comprises the coding sequence 
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for a heterologous PKS. The resulting construct, in which the coding sequence for a 
module of the heterologous PKS is either replaced by that for the tenth extender module 
of the FK-520 PKS or the latter is merely added to coding sequences for the modules of 
the heterologous PKS, provides a novel PKS coding sequence. In another embodiment, a 
DNA compound comprising a sequence that encodes the tenth extender module of the 
FK-520 PKS is inserted into a DNA compound that comprises the coding sequence for 
the remainder of the FK-520 PKS or a recombinant FK-520 PKS that produces an FK- 
520 derivative. 

In another embodiment, a portion or all of the tenth extender module coding 
sequence is utih/xd in conjunction v% ith other PKS coding sequences to create a hybrid 
module. In this embodiment, the invention provides, for example, either replacing the 
malonyl CoA specific AT with a methylmalonyl CoA, ethylmalonyl CoA, or 2- 
hydroxymalonyl CoA specific AT; and/or inserting a KR, a KR and DH, or a KR, DH, 
and an ER. In addition, the KS and/or ACP can be replaced with another KS and/or ACP. 
In each of these replacements or insertions, the heterologous KS, AT, DH, KR, ER, or 
ACP coding sequence can originate from a coding sequence for another module of the 
FK-520 PKS, from a coding sequence for a PKS that produces a polyketide other than 
FK-520, or from chemical synthesis. The resulting heterologous tenth extender module 
coding sequence can be utilized in conjunction with a coding sequence for a PKS that 
synthesizes FK-520, an FK-520 derivative, or another polyketide. In similar fashion the 
corresponding domains in a module of a heterologous PKS can be replaced by one or 
more domains of the tenth extender module of the FK-520 PKS. 

The FK-520 polyketide precursor produced by the action of the tenth extender 
module of the PKS is then attached to pipecolic acid and cyclized to form FK-520. The 
enzyme FkbP is the NRPS like enzyme that catalyzes these reactions. FkbP also includes 
a thioesterase activity that cleaves the nascent FK-520 polyketide from the NRPS. The 
present invention provides recombinant HNA compounds that encode the fkbP gene and 
so provides recombinant methods for expressing xhefkbP gene product in recombinant 
host cells. The recombinant fkbP genes of the invention include those in which the 
coding sequence for the adenylation domain has been mutated or replaced with coding 
sequences from other NRPS like enzymes so that the resulting recombinant FkbP 
incorporates a moiety other than pipecolic acid. For the construction of host cells that do 
not naturally produce pipecolic acid, the present invention provides recombinant DNA 
compounds that express the enzymes that catalyze at least some of the biosynthesis of 
pipecolic acid (see Nielsen et a!., 1991, Biochem. 30: 5789-96). The fkbL gene encodes a 
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homolog of RapL, a lysine cyclodeaminase responsible in part for producing the 
pipecolate unit added to the end of the polykctide chain. The fkbB and fkbi recombinant 
genes of the invention can be used in heterologous hosts to produce compounds such as 
FK-520 or, in conjunction with other PKS or XRPS genes, to produce known or novel 
5 polyketides and non-ribosmal peptides. 

The present invention also provides recombinant DNA compounds that encode 
the P450 oxidase and methyltransferase genes involved in the biosynthesis of FK-520. 
Figure 2 shows the various sites on the FK-520 polykctide core structure at which these 
enzymes act. By providing these genes in recombinant form, the present invention 

10 provides recombinant host cells that can produce FK-520. This is accomplished by 
introducing the recombinant PKS, P450 oxidase, and methyltransferase genes into a 
heterologous host cell. In a preferred embodiment, the heterologous host cell is 
Streptomyces coelicolor CH999 or Streptomyc.es lividans K4-1 14, as described in U.S. 
Patent No. 5,830,750 and U.S. patent application Serial Nos. 08/828,898, filed 31 Mar. 

15 1997, and 09/181,833, filed 28 Oct. 1998, each of which is incorporated herein by 

reference. In addition, by providing recombinant host cells that express only a subset of 
these genes, the present invention provides methods for making FK-520 precursor 
compounds not readily obtainable by other means. 

In a related aspect, the present invention provides recombinant DNA compounds 

20 and vectors that are useful in generating, by homologous recombination, recombinant 
host cells that produce FK-520 precursor compounds. In this aspect of the invention, a 
native host cell that produces FK-520 is transformed with a vector (such as an SCP2* 
derived vector for Streptomyces host cells) that encodes one or more disrupted genes 
(i.e., a hydroxylase, a methyltransferase, or both) or merely flanking regions from those 

25 genes. When the vector integrates by homologous recombination, the native, functional 
gene is deleted or replaced by the non-functional recombinant gene, and the resulting 
host cell thus produces an FK-520 precursor. Such host cells can also be complemented 
by introduction of a modified form of the deleted or mutated non-functional gene to 
produce a novel compound. 

30 In one important embodiment, the present invention provides a hybrid PKS and 

the corresponding recombinant DNA compounds that encode those hybrid PKS 
enzymes. For purposes of the present invention a hybrid PKS is a recombinant PKS that 
comprises all or part of one or more modules and thioesterase/cyclase domain of a first 
PKS and all or part of one or more modules, loading module, and thioesterase/cyclase 
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domain of a second PKS, In one preferred embodiment, the first PKS is all or part of the 
FK-520 PKS, and the second PKS is only a portion or all of a non-FK-520 PKS. 

One example of the preferred embodiment is an FK-520 PKS in which the AT 
domain of module 8, which specifies a hydroxymalonyl CoA and from which the C-13 
5 methoxy group of FK-520 is derived, is replaced by an AT domain that specifies a 
malonyl, methylmalonyl, or ethylmalonyl CoA. Examples of such replacement AT 
domains include the AT domains from modules 3, 12, and 13 of the rapaymycin PKS 
and from modules 1 and 2 of the erythromycin PKS. Such replacements, conducted at 
the level of the gene for the PKS, are illustrated in the examples below. Another 

10 illustrative example of^uch a hybrid PKS includes an FK-520 PKS in which the natural 
loading module has been replaced with a loading module of another PKS. Another 
example of such a hybrid PKS is an FK-520 PKS in which the AT domain of module 
three is replaced with an AT domain that binds methylmalonyl CoA. 

In another preferred embodiment, first PKS is most but not all of a non-FK- 

15 520 PKS, and the second PKS is only a portion or all of the FK-520 PKS. An illustrative 
example of such a hybrid PKS includes an erythromycin PKS in which an AT specific 
for methylmalonyl CoA is replaced with an AT from the FK-520 PKS specfic for 
malonyl CoA. 

Those of skill in the art will recognize that all or part of either the first or second 
20 PKS in a hybrid PKS of the invention need not be isolated from a naturally occurring 
source. For example, only a small portion of an AT domain determines its specificity. 
See U.S. provisional patent application Serial No. 60/091,526, incorporated herein by 
reference. The state of the art in DNA synthesis allows the artisan to construct de novo 
DNA compounds of size sufficient to construct a useful portion of a PKS module or 
25 domain. For purposes of the present invention, such synthetic DNA compounds are 
deemed to be a portion of a PKS, 

Thus, the hybrid modules of the invention are incorporated into a PKS to provide 
a hybrid PKS of the invention. A hybrid PKS of the invention can result not only: 

(i) from fusions of heterologous domain (where heterologous means the domains 
30 in that module are from at least two different naturally occurring modules) coding 

sequences to produce a hybrid module coding sequence contained in a PKS gene whose 
product is incorporated into a PKS, 
but also: 

(ii) from fusions of heterologous module (where heterologous module means two 
35 modules are adjacent to one another that are not adjacent to one another in naturally 

59 



WO 00/2060 1 PCT/US99/22886 

occurring PKS enzymes) coding sequences to produce a hybrid coding sequence 
contained in a PKS gene whose product is incorporated into a PKS, 

(iii) from expression of one or more FK-520 PKS genes with one or morenon- 
FK-520 PKS genes, including both naturally occurring and recombinant non-FK-520 

5 PKS genes, and 

(iv) from combinations of the foregoing. 

Various hybnd PKSs of the invention illustrating these various alternatives are described 
herein. 

Examples of the production of a hybrid PKS by co-expression of PKS genes from 

10 the FK-520 PKS and another non-FK-52^ r :S include hybrid PKS enzymes produced 
by coexpression of FK-520 and rapamycin PKS genes. Preferably, such hybrid PKS 
enzymes are produced in recombinant Streptomyces host cells that produce FK-520 or 
FK-506 but have been mutated to inactivate the gene whose function is to be replaced by 
the rapamycin PKS gene introduced to produce the hybnd PKS. Particular examples 

15 include (i) replacement of the fkhC gene with the rapB gene; and (ii) replacement of the 
fkbA gene with the rapC gene. The latter hybrid PKS produces 13,15-didesmethoxy-FK- 
520, if the host cell is an FK-520 producing host cell, and 13,15-didesmethoxy-FK-506, 
if the host cell is an FK-506 producing host cell. The compounds produced by these 
hybrid PKS enzymes are immunosuppressants and neurotrophins but can be readily 

20 modified to act only as neurotrophins, as described in Example 6, below. 

Other illustrative hybrid PKS enzymes of the invention are prepared by replacing 
the fkbA gene of an FK-520 or FK-506 producing host cell with a hybrid fkbA gene in 
which: (a) the extender module 8 through 10, inclusive, coding sequences have been 
replaced by the coding sequnces for extender modules 12 to 14, inclusive, of the 

25 rapamycin PKS; and (b) the module 8 coding sequences have been replaced by the 
module 8 coding sequence of the rifamycin PKS. When expressed with the other, 
naturally occurring FK-520 or FK-506 PKS genes and the genes of the modification 
enzymes, the resulting hybrid PKS enzymes produce, respectively, (a) 13-desmethoxy- 
FK-520 or 13-desmethoxy-FK-506; and (b) 13-desmethoxy-13-methyl-FK-520 or 13- 

30 desmethoxy-13-methyl-FK-506. In a preferred embodiment, these recombinant PKS 
genes of the invention are introduced into the producing host cell by a vector such as 
pHU204, which is a plamsid pRM5 derivative that has the well-characterized SCP2* 
replicon, the colEl replicon, the tsr and bla resistance genes, and a cos site. This vector 
can be used to introduce the recombinant fkbA replacement gene in an FK-520 or FK- 

35 506 producing host cell (or a host cell derived therefrom in which the endogenous fkbA 
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gene has either been rendered inactive by mutation, deletion or homologous 
recombination with the gene that replaces it) to produce the desired hybrid PKS. 

In constructing hybrid PKSs of the invention, certain general methods may be 
helpful. For example, it is often beneficial to retain the framework of the module to be 
altered to make the hybrid PKS. Thus, if one desires to add DH and ER functionalities to 
a module, it is often preferred to replace the KR domain of the original module with a 
KR, DH, and ER domain-containing segment from another module, instead of merely 
inserting DH and ER domains. One can alter the stereochemical specificity of a module 
by replacement of the KS domain with a KS domain from a module that specifies a 
different stereochemistry. See Lau et at., L . , Dissecting fhe role of acyltransferase 
domains of modular polyketide synthases in the choice and stereochemical fate of 
extender units, 11 Biochemistry* 38(5): 1643- 1651, incorporated herein by reference. 
Stereochemistry can also be changed by changing the KR domain. Also, one can alter the 
specificity of an AT domain by changing only a small segment of the domain. See Lau et 
al, supra. One can also take advantage of known linker regions in PKS proteins to link 
modules from two different PKSs to create a hybrid PKS. See Gokhale et a!., 16 Apr. 
1999, "Dissecting and Exploiting Intermodular Communication in Polyketide 
Synthases," Science 284: 482-485, incorporated herein by reference. 

The following Table lists references describing illustrative PKS genes and 
corresponding enzymes that can be utilized in the construction of the recombinant PKSs 
and the corresponding DNA compounds that encode them of the invention. Also 
presented are various references describing tailoring enzymes and corresponding genes 
that can be employed in accordance with the methods of the present invention. 
Avermcctin 

U.S. Pat. No. 5,252,474 to Merck. 

MacNeil et al, 1993, Industrial Microorganisms: Basic and Applied Molecular 
Genetics , Baltz, Hegeman, & Skatrud, eds. (ASM), pp. 245-256, A Comparison of the 
Genes Encoding the Polyketide Synthases for Avermectin, Erythromycin, and 
Nemadectin. 

MacNeil et al t 1992, Gene 115: 1 19-125, Complex Organization of the 
Streptomyces avermitilis genes encoding the avermectin polyketide synthase. 

Ikeda et al, Aug. 1999, Organization of the biosynthetic gene cluster for the 
polyketide anthelmintic macrolide avermectin in Streptomyces avermitilis, Proc. Natl 
Acad.ScL USA 96: 9509-9514. 
Candicidin (FR008) 
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Hueftf/., 1994, MoL Microbiol, 14: 163-172. 
Epothilone 

U.S. Pat. App. Serial No. 60430,560, filed 22 April 1999. 
Erythromycin 
5 PCT Pub. No. 9343663 to Abbott. 

US Pat. No. 5,824,513 to Abbott. 
Donadio et al,, 1991, Science 253:675-9. 

Cortes et al, 8 Nov. 1990, Nature 34$: 1 76-S, An unusually large 
multifunctional polypeptide in the erythromycin producing polyketide synthase of 
10 Saccharopolyspora eiythraea. 

Glycosylation Enzy mes 

PCT Pat. App. Pub. No. 97/23630 to Abbott. 
FK-506 

Motamedi et ai, 1998, The biosynthetic gene cluster for the macrolactone ring of 
15 the immunosuppressant FK-506, Eur. J. biochem. 256: 528-534. 

Motamedi et a/., 1997, Structural organization of a multifunctional polyketide 
synthase involved in the biosynthesis of the macrohde immunosuppressant FK-506, Eur, 
J. Biochem. 244: 74-80. 

Methyltransferase 

20 US 5,264,355, issued 23 Nov. 1993, Methylating enzyme from 

Strepwmyces MA6858. 31-O-desmethyl-FK-506 methyltransferase. 

Motamedi et aL, 1996, Characterization of methyltransferase and 
hydroxylase genes involved in the biosynthesis of the immunosuppressants FK-506 and 
FK-520, J. Bacteriol 1 78: 5243-5248. 
25 Strepwmyces hygroscopicus 

U.S. patent application Serial No. 09/154,083, filed 16 Sep. 1998. 
Lovastatin 

U.S. Pat. No. 5,744,350 to Merck. 
Narbomycin 

30 U.S. patent application Serial No. 60/107,093, filed 5 Nov. 1998, and Serial No, 

60/120,254, filed 16 Feb, 1999. 
Nemadectia 

MacNeil et ai, 1993, supra. 
Niddamycin 
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Kakavas et al. y 1997, Identification and characterization of the niddamycin 
polyketide synthase genes from Streptomyces caelestis. J. BacterioL 179: 7515-7521. 
Oleandomycin 

Swan et ai, 1994, Characterisation of a Streptomyces antibioticus gene encoding 
a type I polyketide synthase which has an unusual coding sequence, Mol Gen. Genet. 
242: 358-362. 

U.S. patent application Serial No. 60/120,254. filed 16 Feb. 1999. 

Olano et aL, 1998, Analysis of a Streptomyces antibioticus chromosomal region 
involved in oleandomycin biosynthesis, which encodes two glycosyltransferases 
responsible for glycosylation of the macrolactone ring. Mol. Gen. Genet. 259(3): 299- 
308. 

Picromycin 

PCT patent application US99/ 15047, filed 2 Jul. 1999. 
Xue et aL, 1998, Hydroxylation of macrolactones YC-17 and narbomycin is 
mediated by the p/&C-encaded cytochrome P450 in Streptomyces venezuelae, Chemistry 
& Biology^ 5{l\): 661-667. 

Xue et a!., Oct. 1998, A gene cluster for macrolide antibiotic biosynthesis in 
Streptomyces venezuelae: Architecture of metabolic diversity, Proc. Natl. Acad. Sci. 
USA 95: 12111 12116. 
Platenolide 

EP Pat. App. Pub. No. 791,656 to Lilly. 
Rapamycin 

Schwecke et aL, Aug. 1995, The biosynthetic gene cluster for the 
polyketide rapamycin, Proc. NatL Acad. Sci. USA 92:7839-7843. 

Aparicio et al., 1996, Organization of the biosynthetic gene cluster for rapamycin 
in Streptomyces hygroscopicus: analysis of the enzymatic domains in the modular 
polyketide synthase, Gene 169: 9-16. 
Rifamycin 

August et aL, 13 Feb. 1998, Biosynthesis of the ansamycin antibiotic rifamycin: 
deductions from the molecular analysis of the /-//"biosynthetic gene cluster of 
Amycolatopsis mediterranei S669, Chemistry & Biolog\\ 5(2): 69-79. 
Sorangium PKS 

U.S. patent application Serial No. 09/144,085, filed 31 Aug. 1998. 
Soraphen 

U.S. Pat. No. 5,716,849 to Novartis. 
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Schupp et a/., 1995,,/ Bacteriology 7 77: 3673-3679. A Sorangium cellulosum 
(Myxobacterium) Gene Cluster for the Biosynthesis of the Macrolide Antibiotic 
Soraphen A: Cloning, Characterization, and Homology to Polyketide Synthase Genes 
from Actinomycetes. 
5 Spiramycin 

U.S. Pat. No. 5,098,837 to Lilly. 

A ctivator Gene 

U.S. Pat. No. 5,514,544 to Lilly. 
Tylosin 

10 EPPub. No. 791,655 to Lilly. 

U.S. Pat, No. 5,876,991 to Lilly. 

Kuhstoss et al., 1996, Gene 183:231-6., Production of a novel polyketide 
through the construction of a hybrid polyketide synthase. 
Tailoring enzymes 

15 Merson-Davies and Cundliffe, 1994, Mol Microbiol. 13: 349-355. Analysis of 

five tylosin biosymhetic genes from the tylBA region of the Streptomyces fradiae 
genome. 

As the above Table illustrates, there are a wide variety of polyketide synthase 
genes that serve as readily available sources of DNA and sequence information for use in 

20 constructing the hybrid PKS-encoding DNA compounds of the invention. Methods for 
constructing hybrid PKS-encoding DNA compounds are described without reference to 
the FK-520 PKS in PCT patent publication No. 9S/51695; U.S. Patent Nos. 5,672,491 
and 5,712,146 and U.S. patent application Serial Nos. 09/073,538, filed 6 May 1998, and 
09/141,908, filed 28 Aug 1998, each of which is incorporated herein by reference. 

25 The hybrid PKS-encoding DNA compounds of the invention can be and often are 

hybrids of more than two PKS genes. Moreover, there are often two or more modules in 
the hybrid PKS in which all or part of the module is derived from a second (or third) 
PKS. Thus, as one illustrative example, the present invention provides a hybrid FK-520 
PKS that contains the naturally occurring loading module and FkbP as well as modules 

30 one, two, four, six, seven, and eight, nine, and ten of the FK-520 PKS and further 

contains hybrid or heterologous modules three and five. Hybrid or heterologous module 
three contains an AT domain that is specific of methylmalonyl CoA and can be derived 
for example, from the erythromycin or rapamycin PKS genes. Hybrid or heterologous 
module five contains an AT domain that is specific for malonyl CoA and can be derived 

35 for example, from the picromycin or rapamycin PKS genes. 
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While an important embodiment of the present invention relates to hybrid PKS 
enzymes and corresponding genes, the present invention also provides recombinant FK- 
520 PKS genes in which there is no second PKS gene sequence present but which differ 
from the FK-520 PKS gene by one or more deletions. The deletions can encompass one 
5 or more modules and/or can be limited to a partial deletion within one or more modules. 
When a deletion encompasses an entire module, the resulting FK-520 derivative is at 
least two carbons shorter than the gene from which it was derived. When a deletion is 
within a module, the deletion typically encompasses a KR, DH, or ER domain, or both 
DH and ER domains, or both ICR and DH domains, or all three KR, DH, and ER 
10 domains. 

To construct a hybrid PKS or FK-520 derivative PKS gene of the invention, one 
can employ a technique, described in PCT Pub. No. 98/27203 and U.S. patent 
application Serial No. 08/989,332. filed 1 1 Dec. 1997, each of which is incorporated 
herein by reference, in which the large PKS gene is divided into two or more, typically 

15 three, segments, and each segment is placed on a separate expression vector. In this 
manner, each of the segments of the gene can be altered, and various altered segments 
can be combined in a single host cell to provide a recombinant PKS gene of the 
invention. This technique makes more efficient the construction of large libraries of 
recombinant PKS genes, vectors for expressing those genes, and host cells comprising 

20 those vectors. 

Thus, in one important embodiment, the recombinant DNA compounds of the 
invention are expression vectors. As used herein, the term expression vector refers to any 
nucleic acid that can be introduced into a host cell or cell-free transcription and 
translation medium. An expression vector can be maintained stably or transiently in a 

25 cell, whether as part of the chromosomal or other DXA in the cell or in any cellular 
compartment, such as a replicating vector in the cytoplasm. An expression vector also 
comprises a gene that serves to produce RNA that is translated into a polypeptide in the 
cell or cell extract. Furthermore, expression vectors typically contain additional 
functional elements, such as resistance-conferring genes to act as selectable markers. 

30 The various components of an expression vector can vary widely, depending on 

the intended use of the vector. In particular, the components depend on the host cell(s) in 
which the vector will be used or is intended to function. Vector components for 
expression and maintenance of vectors in E. coli are widely known and commercially 
available, as are vector components for other commonly used organisms, such as ye 3 .st 

35 cells and Streptomyces cells. 
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In a preferred embodiment, the expression vectors of the invention are used to 
construct recombinant Streptomvces host cells that express a recombinant PKS of the 
invention. Preferred Streptomvces host cell/vector combinations of the invention include 
S. coclicolor CH999 and S. lividans K4-1 14 host cells, which do not produce 
5 actinorhodin. and expression vectors derived from the pRMl and pRM5 vectors, as 
described in U.S. Patent No. 5,830,750 and U.S. patent application Serial Nos. 
08/828,898, filed 31 Mar. 1997, and 09-181,833, filed 28 Oct. 1 998, each of which is 
incorporated herein by reference. 

The present invention provides a wide variety of expression vectors for use in 

10 Streptom; s. For replicating vectors, the origin of replication can be, for example and 
without limitation, a low copy number vector, such as SCP2* (see Hopwood et aL, 
Genetic Manipulation of Streptomvces; A Laboratory manual (The John Innes 
Foundation, Norwich, U.K., 1985); Lvdiate et ai* 1985, Gene 35: 223-235; and Kieser 
and Melton, 1988, Gene 65: 83-9 L each of which is incorporated herein by reference), 

15 SLP 1.2 (Thompson et aL, 1982, Gene 20: 5 1-62, incorporated herein by reference), and 
SG5(ts) (Muth et a/., 1989, Mol. Gen, Genet, 219: 341-348, and Bierman et al.< 1992, 
Gene 116: 43-49, each of which is incorporated herein by reference), or a high copy 
number vector, such as pi J 1 0 1 and pJVl (see Katz et a/., 1983, J. Gen. Microbiol. 129: 
2703-2714; Vara et aL, 1989, J. Bactenol 1 71: 5782-5781; and Servin-Gonzalez, 1993, 

20 Plasmid 30: 131-140, each of which is incorporated herein by reference). Generally, 

however, high copy number vectors are not preferred for expression of genes contained 
on large segments of DNA. For non-replicating and integrating vectors, it is useful to 
include at least an E. colt origin of replication, such as from pUC, pi P. pi I, and pBR. 
For phage based vectors, the phages phiC3 1 and KC5 1 5 can be employed (see Hopwood 

25 et aL, supra). 

Typically, the expression vector will comprise one or more marker genes by 
which host cells containing the vector can be identified andy'or selected. Useful antibiotic 
resistance conferring genes for use in Streptomvces host cells include the ermE (confers 
resistance to erythromycin and other macrolides and lincomycin), tsr (confers resistance 

30 to thiostrepton), aadA (confers resistance to speciinomycin and streptomycin), aacC4 
(confers resistance to apramycin, kanamycin, gentamicin, geneticin (G418), and 
neomycin), hyg (confers resistance to hygromycin), and vph (confers resistance to 
viomycin) resistance conferring genes. 

The recombinant PKS gene on the vector will be under the control of a promoter, 

35 typically with an attendant ribosome binding site sequence. The present invention 
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provides the endogenous promoters of the FK-520 PKS and related biosynthetic genes in 
recombinant form, and these promoters are preferred for use in the native hosts and in 
heterologous hosts in which the promoters function. A preferred promoter of the 
invention is \hc_fkbO gene promoter, comprised in a sequence of about 270 bp between 
the start of the open reading frames of the jkbO mdJkbB genes. The JknO promoter is 
believed to be bi-directional in that it promotes transcription of the genes fkbO,fkbP, and 
fkbA in one direction andfkbB,fkbC t and fkbL in the other. Thus, in one aspect, the 
present invention provides a recombinant expression vector comprising the promoter of 
the fkbO gene of an FK-520 producing organism positioned to transcribe a gene other 
than jVuO ^referred embodiment the transcribed \. ' an FK-520 PKS gene. In 
another preferred embodiment, the transcribed gene is a gene that encodes a protein 
comprised in a hybrid PKS. 

Heterologous promoters can also be employed and are preferred for use in host 
cells in which the endogenous FK-520 PKS gene promoters do not function or function 
poorly, A preferred heterologous promoter is the act I promoter and its attendant activator 
gene actlI-ORF4, which is provided in the pRMl and pRM5 expression vectors, supra. 
This promoter is activated in the stationary phase of growth when secondary' metabolites 
are normally synthesized. Other useful Streptomyces promoters include without 
limitation those from the ermE gene and the melCl gene, which act constitutively, and 
the tip A gene and the merA gene, which can be induced at any growth stage. In addition, 
the T7 RNA polymerase system has been transferred to Streptomyces and can be 
employed in the vectors and host cells of the invention. In this system, the coding 
sequence for the T7 RNA polymerase is inserted into a neutral site of the chromosome or 
in a vector under the control of the inducible merA promoter, and the gene of interest is 
placed under the control of the T7 promoter. As noted above, one or more activator 
genes can also be employed to enhance the activity of a promoter. Activator genes in 
addition to the actII-ORF4 gene discussed above include dnrl, redD, and ptpA genes (see 
U.S. patent application Serial No. 09/181,833, supra) to activate promoters under their 
control. 

In addition to providing recombinant DNA compounds that encode the FK-520 
PKS, the present invention also provides DNA compounds that encode the ethylmalonyl 
CoA and 2-hydroxymalonyl CoA utilized in the synthesis of FK-520. Thus, the present 
invention also provides recombinant host cells that express the genes required for the 
biosynthesis of ethylmalonyl CoA and 2-hydroxymalonyl CoA. Figures 3 and 4 show the 
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location of these genes on the cosmids of the invention and the biosynthetic pathway that 
produces ethylmalonyl CoA. 

For 2-hydroxymalonyl CoA biosynthesis, the fkbIL fkbL fkbj, m&jkbK genes are 
sufficient to confer this ability on Streptomcyces host cells. For conversion of 2- 
5 hydroxymalonyl to 2-methoxymalonyl, the fkbG gene is also employed. While the 
complete coding sequence for jkbll is provided on the cosmids of the invention, the 
sequence for this gene provided herein may be missing a T residue, based on a 
comparison made with a similar gene cloned from the ansamitocin gene cluster by Dr. H. 
Floss. Where the sequence herein shows one T, there may be two. resulting in an 

10 extension of the fkbH reading frame to encode the amino acid sequence: 

MTIVKCLVWDLDNTLWRGTVLEDDEVVLTDEIREVITTLDDRGILQAVASKNDH 
DLAW^ERLERLGVAEYFVLARIGWGPKSQSVREIATELNFAPTTIAFIDDQPAERA 
EVAFHLPEVRCYPAEOAATLLSLPEFSPPVSTVDSRRRRLMYQAGFARDQ.\REA 
YSGPDEDFLRSLDLSMTIAPAGEEELSRVEELTLRTSQMNATGVHYSDADLRAL 

15 LTDPAHEVLVVTMGDRFGPHGAVGIILLEKXPSTXVTiLKLLATSCRVVSFGAGAT 
ILNWLTDQGARAGAHLVADFRRTDRNRMMEIAYRFAGFADSDCPCVSEVAGAS 
AAGVERLHLEPSARPAPTTLTLTAADIAPVTVSAAG. 

For ethylmalonyl CoA biosynthesis, one requires only a crotonyl CoA reductase, 
which can be supplied by the host cell hut can also be supplied by recombinant 

20 expression of the JkbS gene of the present invention. To increase yield of ethylmalonyl 
CoA, one can also express the fkbE zxAjkbU genes as well. While such production can 
be achieved using only the recombinant genes above, one can also achieve such 
production by placing into the recombinant host cell a large segment of the DNA 
provided by the cosmids of the invention. Thus, for 2-hydroxymalonyl and 2- 

25 methoxymalonyl CoA biosynthesis, one can simply provide the cells with the segment of 
DNA located on the left side of the FK-520 PKS genes shown in Figure 1 . For 
ethylmalonyl CoA biosynthesis, one can simpiy provide the cells with the segment of 
DNA located on the right side of the FK-520 PKS genes shown in Figure 1 or, 
alternatively, both the right and left segments of DNA. 

30 The recombinant DNA expression vectors that encode these genes can be used to 

construct recombinant host cells that can make these important polyketide building 
blocks from cells that otherwise are unable to produce them. For example, Streptomyces 
coelicolor and Streptomyces lividans do not synthesisze ethylmalonyl CoA or 2- 
hydroxymalonyl CoA. The invention provides methods and vectors for constructing 

35 recombinant Streptomyces coelicolor and Streptomyces lividans that are able to 
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synthesize either or both ethylmalonyl CoA and 2-hydroxymalonyl CoA. These host 
cells are thus able to make polyketides. those requiring these substrates, that cannot 
otherwise be made in such ceils. 

In a preferred embodiment, the present invention provides recombinant 
5 Streptomyces host cells, such as S. coelicolor and S. liviaans. that have been transformed 
with a recombinant vector of the inv ention that codes for the expression of the 
ethylmalonyl CoA biosynthetic genes. The resulting host cells produce ethylmalonyl 
CoA and so are preferred host cells for the production of polyketides produced bv PKS 
enzymes that comprise one or more AT domains specific for ethylmalonyl CoA. 

10 Illustrative PKS enzymes of this type include die FK-520 PKS and a recombinant PKS in 
which one or more AT domains is specific for ethylmalonyl CoA. 

In a related embodiment, the present invention provides Streptomyces host cells 
in which one or more of the ethylmalonyl or 2-hydroxymalonyl biosynthetic genes have 
been deleted by homologous recombination or rendered inactive by mutation. For 

15 example, deletion or inactivation of xhzfkbG gene can prevent formation of the methoxyl 
groups at C-13 and C-15 of FK-520 (or, in the corresponding FK-506 producing cell, 
FK-506), leading to the production of 13,15-didesmethoxy-13,15-dihydroxy-FK-520 
(or, in the corresponding FK-506 producing cell, 1 3,1 5-didesmethoxy- 13,1 5-dihydroxy- 
FK-506). If the fkhG gene product acts on 2-hydroxymalonyl and the resulting 2- 

20 methoxymaionyl substrate is required for incorporation by the PKS, the AT domains of 
modules 7 and 8 may bind malonyl CoA and methylmalonyl CoA. Such incorporation 
results in the production of a mixture of polyketides in which the methoxy groups at C- 
13 and C-15 of FK-520 (or FK-506) are replaced by either hydrogen or methyl. 

This possibility of non-specific binding results from the construction of a hybrid 

25 PKS of the invention in which the AT domain of module 8 of the FK-520 PKS replaced 
the AT domain of module 6 of DEBS. The resulting PKS produced, in Streptomyces 
lividans, 6-dEB and 2-desmcthyl-6-dEB, indicating that the AT domain of module 8 of 
the FK-520 PKS could bind malonyl CoA and methylmalonyl CoA substrates. Thus, one 
could possibly also prepare the 13,15-didesmethoxy-FK-520 and corresponding FK-506 

30 compounds of the invention by deleting or otherwise inactivating one or more or all of 
the genes required for 2-hydroxymalonyl CoA biosynthesis, i.e., the fkbH,fkbI,fkbJ, and 
fkbK genes. In any event, the deletion or inactivation of one or more biosynthetic genes 
required for ethylmalonyl and/or 2-hydroxymalonyl production prevents the formation of 
polyketides requiring ethylmalonyl and/or 2-hydroxymalonyl for biosynthesis, and the 

69 



WO 00/2060 1 PCT/US99/22886 

resulting host cells are thus preferred for production of polyketides that do not require 
the same. 

The host cells of the invention can be grown and fermented under conditions 
known in the art for other purposes to produce the compounds of the invention. See, e.g., 
5 U.S. Patent Nos. 5,194,378; 5,1 16,756; and 5,494, S20. incorporated herein by reference, 
for suitable fermentation processes. The compounds of the invention can be isolated 
from the fermentation broths of these cultured cells and punned by standard procedures. 
Preferred compounds of the invention include the following compounds: 13- 
desmethoxy-FK-506; 1 3^desmethoxy-FK-520; 13.15-didesmethoxy-FK-506; 13,15- 

1 0 didcsmethoxy-FK-520; 1 3-dcsmethoxy-l S-hyciroxy-FK-506: 1 3-desmethoxy-l 8- 

hydroxy-FK-520; 13,15-didcsmethoxy-lS-hydroxy-FK-506: and 1 3,1 5-didesmethoxy- 
lS-hydroxy-FK-520. These compounds can be further modified as described for 
tacrolimus and FK-520 in U.S. Patent Nos. 5,225,403; 5,1 89.042; 5,164,495; 5,068,323; 
4,980,466; and 4,920,2 1 8, incorporated herein by reference. 

1 5 Other compounds of the invention are shown in Figure S, Parts A and B. In 

Figure 8, Part A, illustrative C-32-substituted compounds of the invention are shown in 
two columns under the heading R. The substituted compounds are preferred for topical 
administration and are applied to the dermis for treatment of conditions such as psoriasis. 
In Figure S, Part B, illustrative reaction schemes for making the compounds shown in 

20 Figure 8, Part A, are provided. In the upper scheme in Figure S, Part B, the C-32 

substitution is a tetrazole moiety, illustrative of the groups shown in the left column 
under R in Figure 8, Part A. In the lower scheme in Figure S, Part B, the C-32 
substitution is a disubstituted amino group, where R3 and R 4 can be any group similar to 
the illustrative groups shown attached to the amine in the right column under R in Figure 

25 8, Part A. While Figure 8 shows the C-32-substitu^d compounds in which the C-15- 
methoxy is present, the invention includes these C-32-substituted compounds in which 
C-15 is ethyl, methyl, or hydrogen. Also, while C-21 is shown as substituted with ethyl 
or ally 1, the compounds of the invention includes the C-32-substituted compounds in 
which C-21 is substituted with hydrogen or methyl. 

30 To make these C-32-substitutcd compounds, Figure 8, Part B, provides 

illustrative reaction schemes. Thus, a selective reaction of the starting compound (see 
Figure 8, Part B, for an illustrative starting compound) with trifiuoromethanesulfonic 
anhydride in the presence of a base yields the C-32 O-tnflate derivative, as shown in the 
upper scheme of Figure 8, Part B. Displacement of the triflate with lH-tetrazole or 

35 triazole derivatives provides the C-32 tetrazole or teiazole derivative. As shown in the 
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lower scheme of Figure 8, Part B, reacting the starting compound with p- 
nitrophenylchloroformate yields the conrespoinding carbonate, which, upon displacement 
with an amino compound, provides the corresponding carbamate derivative. 

The compounds can be readily formulated to provide the pharmaceutical 
compositions of the invention. The pharmaceutical compositions of the invention can be 
used in the form of a pharmaceutical preparation, for example, in solid, semisolid, or 
liquid form. This preparation contains one or more of the compounds of the invention as 
an active ingredient in admixture with an organic or inorganic earner or excipient 
suitable for external, enteral, or parenteral application. The active ingredient may be 
compotm ^d, for ex imple, with the usual non-toxic, pharmaceutical^ acceptable carriers 
for tablets, pellets, capsules, suppositories, solutions, emulsions, suspensions, and any 
other form suitable for use. Suitable formulation processes and compositions for the 
compounds of the present invention are described with respect to tacrolimus in U.S. 
Patent Nos. 5,939,427; 5,922,729; 5,385,907; 5,338,684; and 5,260,301, incorporated 
herein by reference. Many of the compounds of the invention contain one or more chiral 
centers, and all of the stereoisomers are included within the scope of the invention, cs 
pure compounds as well as mixtures of stereoisomers. Thus the compounds of the 
invention may be supplied as a mixture of stereoisomers in any proportion. 

The carriers which can be used include water, glucose, lactose, gum acacia, 
gelatin, mannitol, starch paste, magnesium trisilicate, talc, corn starch, keratin, colloidal 
silica, potato starch, urea, and other carriers suitable for use in manufacturing 
preparations, in solid, semi-solid, or liquified form. In addition, auxiliary stabilizing, 
thickening, and coloring agents and perfumes may be used. For example, the compounds 
of the invention may be utilized with hydroxypropyl rnethylcellulose essentially as 
described in U.S. Patent No. 4,916,138, incorporated herein by reference, or with a 
surfactant essentially as described in EPO patent publication No. 428,169, incorporated 
herein by reference. 

Oral dosage forms may be prepared essentially as described by Hondo et aL, 

1987, Transplantation Proceedings XIX, Supp. 6: 17-22, incorporated herein by 
reference. Dosage forms for external application may be prepared essentially as 
described in EPO patent publication No. 423,714, incorporated herein by reference. The 

active compound is included in the pharmaceutical composition in an amount sufficient 

to produce the desired effect upon the disease process or condition. 

For the treatment of conditions and diseases relating to immunosuppression or 

neuronal damage, a compound of the invention may be administered orally, topically, 
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parenterals, by inhalation spray, or recially in dosage unit formulations containing 
conventional non-toxic pharmaceutically acceptable earners, adjuvant, and vehicles. The 
term parenteral, as used herein, includes subcutaneous injections, and intravenous, 
intramuscular, and intrasternal injection or infusion techniques. 
5 Dosage levels of the compounds of the present invention are of the order from 

about 0.01 mg to about 50 nig per kilogram of body weight per day, preferably from 
about 0.1 mg to about 10 mg per kilogram of body weight per daw The dosage levels are 
useful in the treatment of the above-indicated conditions ( from about 0.7 mg to about 3.5 
mg per patient per day. assuming a 70 kg patient). In addition, the compounds of the 

10 present invention may be administered on an internnuem basis. ■ at semi-weekly, 
weekly, semi-monthly, or monthly intervals. 

The amount of active ingredient that may be combined with the carrier materials 
to produce a single dosage form will van' depending upon the host treated and the 
particular mode of administration. For example, a formulation intended for oral 

15 administration to humans may contain from 0.5 mg to 5 g of active agent compounded 
with an appropriate and convenient amount of earner material, which may vary from 
about 5 percent to about 95 percent of the total composition. Dosage unit forms will 
generally contain from about 0.5 mg to about 500 mg of active ingredient. For external 
administration, the compounds of the invention can be formulated within the range of, 

20 for example, 0.00001% to 60% by weight, preferably from 0.001% to 10% by weight, 
and most preferably from about 0.005% to 0.8% by weight. The compounds and 
compositions of the invention are useful in treating disease conditions using doses and 
administration schedules as described for tacrolimus in U.S. Patent Nos. 5,542,436; 
5,365,948; 5,348,966; and 5,196,437, incorporated herein by reference. The compounds 

25 of the invention can be used as single therapeutic agents or in combination with other 
therapeutic agents. Drugs that can be usefully combined with compounds of the 
invention include one or more immunosuppressant agents such as rapamycin, 
cyclosporin A, FK-506, or one or more neurotrophic agents. 

It will be understood, however, that the specific dosage level for any particular 

30 patient will depend on a variety of factors. These factors include the activity of the 

specific compound employed; the age, body weight, general health, sex, and diet of the 
subject; the time and route of administration and the rate of excretion of the drug; 
whether a drug combination is employed in the treatment; and the seventy of the 
particular disease or condition for which therapy is sought. 
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A detailed description of the invention having been provided above, the 
following examples are given for the purpose of illustrating the present invention and 
shall not be construed as being a limitation on the scope of the invention or claims. 



^ Example 1 

Replacement of Methoxvl with Hvdrogen or Methyl at CM 3 of FK-520 
The C-13 methoxvl group is introduced into FK-520 via an AT domain in 
extender module 8 of the PKS thai is specific for hydroxymalonyl and by methylation of 
the hydroxy! group by an S-adenosyl methionine (SAM) dependent methyltransferase. 

10 Metabolism of FK-506 and FK-520 primarily involves oxidation at the C-13 position 

into an inactive derivative that is further degraded by host P450 and other enzymes. The 
present invention provides compounds related in structure to FK-506 and FK-520 that do 
not contain the C-13 methoxy group and exhibit greater stability and a longer half-life in 
vivo. These compounds are useful medicaments due to their immunosuppressive and 

15 neurotrophic activities, and the invention provides the compounds in punFied form and 
as pharmaceutical compositions. 

The present invention also provides the novel PKS enzymes that produce these 
novel compounds as well as the expression vectors and host cells that produce the novel 
PKS enzymes. The novel PKS enzymes include, among others, those that contain an AT 

20 domain specific for either malonyl CoA or methylmalonyl CoA in module S of the FK- 
506 and FK-520 PKS. This example describes the construction of recombinant DNA 
compounds that encode the novel FK-520 PKS enzymes and the transformation of host 
cells with those recombinant DNA compounds to produce the novel PKS enzymes and 
the polyketides produced thereby. 

25 To construct an expression cassette for performing module 8 AT domain 

replacements in the FK-520 PKS, a 4.6 kb Sphl fragment from the FK-520 gene cluster 
was cloned into plasmid pLitmus 38 (a cloning vector available from New England 
Biolabs). The 4.6 kb Sphl fragment, which encodes the ACP domain of module 7 
followed by module 8 through the KR domain, was isolated from an agarose gel after 

30 digesting the cosmid pKOS65-C31 with Sph I. The clone having the insert oriented so 
the single Sad site was nearest to the Spel end of the polylinker was identified and 
designated as plasmid pKOS60-21-67. To generate appropriate cloning sites, two linkers 
were ligated sequentially as follows. First, a linker was ligated between the Spel and 
Sacl sites to introduce a BgUl site at the 5' end of the cassette, to eliminate interfering 

35 polylinker sites, and to reduce the total insert size to 4.5 kb (the limit of the phage 
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K.C515). The ligation reactions contained 5 picomolar unphosphorylatcd linker DNA and 

0.1 picomolar vector DNA, i.e., a 50-fold molar excess of linker to vector. The linker had 

the following sequence: 

5'-CTAGTGGGCAGATCTGGCAGCT-3' 
5 3 '-ACCCGTCTAG ACCG-5 ' 

The resulting plasmid was designated pKOS60-27-l. 

Next, a linker of the following sequence was ligated between the unique Spin and 

AJlll sites of plasmid pKOS60-27-l to introduce an As/ 1 sue at the 3' end of the module 

S cassette. The linker employed was: 

1 0 5 ' - GGG ATGC ATGGC-3 ' 

3 , -GTACCCCTACGTACCGAATT-5' 

The resulting plasmid was designated pKOS60-29-55. 

To allow in-frame insertions of alternative AT domains, sites were engineered at 

the 5' end (Avr II or Nhe I) and V end (Alio I) of the AT domain using the polymerase 

15 chain reaction (PCR) as follows. Plasmid pKOS60-29-55 was used as a template for the 

PCR and sequence 5' to the AT domain was amplified with the primers SpeBgl-fwd and 

either Avr-rev or Nhe-rev: 

SpeBgl-fwd S'-CGACTCACTAGTGGGCAGATCTGG^' 

Avr-rev S'-CACGCCTAGGCCGGTCGGTCTCGGGCCAC^ 

20 Nhe-rev S'-GCGGCTAGCTGCTCGCCCATCGCGGGATGC-S' 

The PCR included, in a 50 |il reaction, 5 \x\ of lOx Pfu polymerase buffer 

(Stratagene), 5 ul lOx z-dNTP mixture (2 mM dATP, 2 mM dCTP, 2 mM dTTP, 1 mM 

dGTP, 1 mM 7-deaza-GTP), 5 |il DMSO, 2 ^1 of each primer (10 fiM), 1 \x\ of template 

DNA (0.1 tig/|il), and 1 of cloned Pfu polymerase ( Stratagene). The PCR conditions 

25 were 95°C for 2 min., 25 cycles at 95°C for 30 sec, 60°C for 30 sec, and 72°C for 4 

min., followed by 4 min. at 72°C and a hold at 0°C. The amplified DNA products and 

the Litmus vectors were cut with the appropriate restriction enzymes (BgHl and ^vrll or 

Spel and Nhel), and cloned into either pLitmus 28 or pLitmus38 (New England Biohbs), 

respectively, to generate the constructs designated pKOS60-37-4 and pKOS60-37-2, 

30 respectively. 

Plasmid pKOS60-29-55 was again used as a template for PCR to amplify 

sequence 3' to the AT domain using the primers BsrXho-fwd and NsiAfl-rev: 

BsrXho-fwd S'-GATGTACAGCTCGAGTCGGCACGCCCGGCCGCATCO 1 

NsiAfl-rev 5 ' - CG ACTC ACTTAAGCC ATGC ATCC-3 ' 

35 PCR conditions were as described above. The PCR fragment was cut with BsrGl 

and 4/7II, gel isolated, and ligated into pKOS60-37-4 cut with Asp7\8 and AfUl and 
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insencd into pKOS60-37-2 cut with BsrGl and 4/71 1, to give the plasmids pKOS60-39~l 
and pKOS60-39-13, respectively. These two plasmids can be digested with Avrll and 
A7zoI or Nhel and Xhol, respectively, to insert heterologous AT domains specific for 
malonyl, methylmalonyl, ethylmalonyl, or other extender units. 

Malonyl and methylmalonyl-specific AT domains were cloned from the 
rapamycin cluster using PCR amplification with a pair of primers that introduce an Avrll 
or Mi el site at the 5' end and an Xhol site at the 3 1 end. The PCR conditions were as 
given above and the primer sequences were as follows: 

RATN1 S'-ATCCTAuGCGGGCRGGYGTGTCGTCCTTCGG^ 
(3 1 end of Rap KS sequence and universal for malonyl and methylmalonyl Co A), 
RATMN2 S'-ATGCTAGCCGCCGCGTTCCCCGTCTTCGCGCG-S' 
(Rap AT shorter version 5'- sequence and specific for malonyl CoA), 
RATMMN2 5 1 - ATGCT AGCGG ATTCGTCGGTGGTGTTCGCCGA-3 1 
(Rap AT shorter version 5'- sequence and specific for methylmalonyl CoA), and 
RATC 5 , -ATCTCGAGCCAGTASCG^TGGTGYTGGAAGG-3 , 
(Rap DH 5 1 - sequence and universal for malonyl and methylmalonyl CoA). 
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MMN2 - \'hel 

NI -Avrll: MN2 - \ r hel 
\ ^ x ^ 



ICS 



AT 



Any Rap Module 
^Xhol - C 

Because of the high sequence similarity in each module of the rapamycin cluster, 
each primer was expected to prime any of the AT domains. PCR products representing 
ATs specific for malonyl or methylmalonyl extenders were identified by seque^mg 
individual cloned PCR products. Sequencing also confirmed that the chosen clones 
contained no cloning artifacts. Examples of hybrid modules with the rapamycin ATI 2 
and ATI 3 domains are shown in a separate figure. 

^he AvrWXhol restriction fragment that encodes module 8 of the FK-520 PKS 
with the endogenous AT domain replaced by the AT domain of module 12 of the 
rapamycin PKS has the DNA sequence and encodes the amino acid sequence shown 
below. The AT of rap module 12 is specific for incorporation of malonyl units. 

AGATCTGGCAGCTCGCCGAAGCGCTGCTGACGCTCGTCCGGGAGAGCACC 50 

IWQLAEALLTLVREST 
GGGGGCGTGCTCGGCCACGTGGGTGGCGAGGAChrCCCCGCGACGGCGGC 100 

AAVLGHVGGEDI ? A T A A 
GTTCA.AGGACCTGGGCATCGACTCGCTCACCGCGGTCCAGGTGCGCAACG 150 
FKDLGIDSLTAVQLRN 
C * C A.l, v^G AGuCG ACCGGTGTGCGGCT G AA.CGC CAGGGCGoTCTTCG.- G 2 00 
ALT EATGVRLNATAVFD 
TTCCCGACCCCGCACGTGCTCGCCGGGAAGCTCGGCGACGAACTGACCGG 2 50 

FPTPHVLAGKLGDELTG 
CAC C CGC GCGCCCGTCG TGCCCCGGACGGCG GCC AC GGCCGGTGGGC AC G 3 00 

T R A PVVPRTAATAGAH 
ACGAGCCGCTGGCGATCGTGGGAATGGCCTGCCGGCTGCCCGGCGGGGTC 3 50 
0 E P L A I V G M A C P. L P G G V 
GCGTCACCCGAGGAGCTGTGGCACCTCGTGGCATCCGGCACCGACGCCAT 4 00 

ASPEELWHLVASGTDAI 
CACGGAGTTCCCGACGGACCGCGGCTGGGA.CGTCGACGCGATCTACGA.ee 4 50 

TEFPTDRGWDVDAIYD 
CGGACCCCGACGCGATCGGCAAGACCTTCGTCCGGCACGGTGGCTTCCTG 500 
PDPDAIGKTFVRHGGFL 
ACCGGCGCGACAGGCTTCGACGCGGCGTTCTTCGGCATCAGCCCGCGCGA 550 

TGATGFDAAFFG I SPRE 
GGCCCTCGCGATGGACCCGCAGCAGCGGGTGCTCCTGGAGACGTCGTGGG 600 

ALAMDPQQRVLLETSW 
AGGCGTTCGAAAGCGCCGGCATCACCCCGGACTCGACCCGCGGCAGCGAC 650 
EAFESAG ITPDST rgsd 
ACCGGCGTGTTCGTCGGCGCCTTCTCCTACGGTTACGGCACCGGTGCGGA 7 00 

TGVFVGAFSYGYGTGAD 
CACCGACGGCTTCGGCGCGACCGGCTCGCAGACCAGTGTGCTCTCCGGCC 7 50 

TDGFGATGSQTSVLSG 
GGCTGTCGTACTTCTACGGTCTGGAGGGTCCGGCGGTCACGGTCGACACG 8 00 
RLSYFYGLEGPAVTVDT 
GCGTGTTCGTCGTCGCTGGTGGCGCTGCACCAGGCCGGGCAGTCGCTGCG 8 50 
ACSSSLVALHQAGQSLR 
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C7CCGGCGAA7GCTCGCTCGCCCTGGTCGGCGGCG7CACGGTGATGGCG? 9 00 

5 G E C S L A L V G G V 7 V K A 
C7CCCGGCGGCTTCGTGGAG77 C7CCCGGCAGCGCGGCCTCGCGCCGGAC 950 
G FGGFVEFSRQRGLAPD 
5 GGCCGGGCGAAGGCG77CGGCGCGGGTGCGGACGGCACGAGC77CGCCGA 1000 
G R A K A FGAGADGTS F A £ 
GGGTGCGGGTG7GCTGA7CGTCGAGAGGC7C7CCGAGGCCGAACGCAACG 10 50 

G A G V L I V E R L S D A E R N 
G 7 C A CACCG TCCVGGCGGTCGTCCG TGGTT CGGCGGT C A A C CAGG A 7G G T HOC 
10 G H T V L A V V R G S A V N Q D G 

oG C7CC AA.GGG GCTGTCGGC G GGG AACG^GGGGT G^GA^G A-j : -GGGT ^r-.T 1 - 5 0 

ASNGLSAPNGPSQER V* I 
CCGGCAGGCCCTGGCCAACGCCGGGCTCACGCCGGCGGACGTGGACGCCG 1200 
RQALANAGLTPAD V D A 
1 5 AGGAGGGCGAGGGCACCGGCAGCAGGGTGGGCGAGGCGATGGAGGOAGAG 12 50 
VEAHGTGTRLGDPI E A Q 

^ GT AC7GGGC AGC7 ACGG A'^AuGAb^uCGC -■■-■^ ■ — - o-o o ^3^0 
A V L A 7 Y G Q E R A 7 P 1, L L G 
C7CGCTGAAG7CCAACATCGGCCACGCCCAGGCCGCG7CCGGCG7CGCCG 1350 
20 SLKSNIGHAQAASGVA 

^ G ATGATC A-r^'o ATGGTGCAGGGl.GTGGGGv_,AC'oG oGAuGCl G^ w^^^^r^o i4 00 
GIIKMVQALRHGELPPT 
CTGCACGCCGACGAGCCGTCGCCGCACGTCGACTGGACGGCCGGCGGCGT 1-15 0 
7 HAD E PS P H V 0 W T A G A V 
25 CGAACTGCTGACGTCGCCCCGGCCGTGGCCCGAGACCGACCGGCGTAGGC 1500 
ELLTSARPWPETDRPR 
GGGCAGGCGTGTCGTCCTTCGGGATCAG7GGCACCAACGCCCACGTCATC 15 5 0 
R A G V S S F G I S G T N A K V I 
C7GGAAAGCGCACCCCCCACTCAGCCTGCGGACAACGCGGTGATCGAGCG 160 0 
30 GESAP PTQPADNAVIER 

GGCACCGGAG7GGG7GCCG77GG7GA777GGGCCAGGACCGAG7CGGC77 i 650 

APEWVPLVISARTQSA 
TGAC7GAGCACGAGGGCCGG77GCG7GCG7A7CTGGCGGCG7CGCCCGGG i / 0 0 
LTEKEGRLRAYLAASPG 
35 G7GGATATGCGGGGTGTGGCATCGACGCTGGCGATGACACGGTCGGTGT7 17 50 
VDMRAVASTLAMTRSVF 
C G AG G AG C G 7 G G G G7G G T G C T G G GAG AT GAG AC C G T C AC CG G GAG C GG T G 13 00 

EHRAVLLGDDTVTGTA 
7GTCTGACCCTCGGGCGGTG7TCGTCTTGCCGGGACAGGGG7CGCAGCGT 18 50 
40 V SCPRAVrVFFGQGSQR 

GCTGGCATGGGTGAGGAACTGGCCGCCGCGT7CCCCGTCTTCGCGCGGA7 1900 

AGMGEELAAAFPVFARI 
CCATCAGCAGGTGTGGGACCTGCTCGA7GTGCCCGATCTGGAGGTGAACG 1950 
HQQVWDLLDVPDLEVN 
45 AGACCGGTTACGCCCAGCCGGCCCTGTTCGCAATGCAGGTGGCTCTGT7C 2 00 0 
ETGYAQPALFAMQVALF 
GGGCTGCTGGAATCGTGGGGTGTACGACCGGACGCGGTGATCGGCCA77C 20 5 0 

GLLESWGVRPDAVIGKS 
GG7GGGTGAGCTTGCGGCTGCGTATGTG7CCGGGG7GTGGTCGT7GGAGG 2100 
50 VGELAAAYVSGVWSLE 

ATGCCTGCACTTTGGTGTCGGCGCGGGC7CGTCTGATGCAGGCTC7GCCC 2150 
DACTLVSARARLMQALP 
GCGGGTGGGGTGhTGGTGGCTGTCCCGG^CTCGGAGGMGAGGCCCGGGC 2200 
AGGVMVAVPVSE DEARA 
55 CGTGCTGGGTGAGGGTGTGGAGATCGCCGCGGTCAACGGCCCGTCGTCGG 22 50 
VLGEGVEIAAVNGPSS 
TGGTTCTCTCCGGTGATGAGGCCGCCGTGCTGCAGGCCGCGGAGGGGC7G 2300 
VVLSG DEAAVLQAAEGL 
GGG^AGTGGACGCCGCTGG CGACC A GGCACGCGT7CCAT7CC GCCCG 7 A7 2350 
60 GKWTRLATSHAFHSARM 

GGAACCCA7GCTGGAGGAGTTCCGGGCGGTCGCCGAAGGCCTGACCTACC 24 00 

EPMLEEERAVAEGL7Y 
GGACGCCGCAGGTCTCCATGGCCG7TGGTGATCAGGTGACCACCGCTGAG 2 4 50 
RTPQVSMAVGDQVTTAE 
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TAGTGGG7GCGGCAGGTCGGGGACACGGTCGGGTTCGGCGAGCAGGTGGC 2 500 

V . W V R Q V R D T V R F G F Q V A 
CI CG'"ACGAG GACGCCG7 G77GG7CGAjGC7GGG7 -jCCGA-CCGGTCACI GG 2 5 50 

3 Y £ D A V F V E L G A D R S G 
G G G G C C T G G T G G A C G G T G 7 C G 2 G A T G C T G C A 7 G G C ■ j A G G ^ G o AA_k 7 C - ^ 2 600 
A R L V D G V A M L H G C H F I Q 
GCCGCGATCGGCGCCC7 1 GGCCCACCTGTA 1 TGTG.*-_kCGGCGTCACGG j. v.G~. 2 6 50 

A A I G A L A H L Y V N G V T V D 
CTGGCCCGCGCTCCTGGGCGATGCTCCGGCAACACGGGTGCTGGACCTTC 27 00 

W P A L L G D A P A T R V L D L 
C G AG A 7 AG G G C T T C G AG G A G C .AG C G G 7 A C 7 G G G 7 G G A G 7' G G G C A ZGZZZ G 2 7 50 
P T YAFQHQRY W L E S A R ? 
G^ GG : „ A 7 CCG ACGCGGoCG AG C 1 ^ G G . G G 7 j j i j _ j. ^ o jTr.a Lw^^C. ~ o „ ^800 

A A S D A G H ? V 7 G 3 G I A I A 
G G G G 7 G G 7 G ^ : ~ ~ ^ G " G G T G 7 7 G AG G G G 7 T G C G 7 G G G G A G 7 3 G 1' G C G G A G G 2 3 !j 0 

G G . R V F T G S V P T 3 7 

GCuC^ w7GTTG^TGGGGoA^sC7 _;Gl>GCT^G ^^vj^Cu^G'jrt^u^'j'j j. ^or\^ 3 900 
R A V F V A E L A L A A A DA V D 
TGCGCCACGGTCGAGCGGCTCGACATCGCCTCCGTGCCCGGCCGGCGGGG 2 9 50 

G A T V E R L D I A S V P G R P G 
CCATGGGCGGAGGhCCGrAChGACG7GGG7CGACGAGCCGGCGGhGGhCG 3000 

H G R T T V Q T W V D F P A D D 
GCCGGCGCCGGTTCACCGTGCACACCCGCACCGGCGACGCCCCGTGGACG 3050 
G R R R F T V H T R 7 G 0 A P W T 
GTGCACGCCGAGGGGGTGCTGCGCCCCCATGGCACGGCCCTGCCCGATGC 3100 

GHAEGVLRPKGTALPDA 
GGCCGACGCCGAGTGGCCCCCACCGGGCGCGGTGCCCGCGGACGGGCTGC 315 0 

ADAEWPPPGAVPADGL 
CGGGTGTGTGGCGCCGGGGGGACCAGGTCTTGGCCGAGGCCGAGGTGGAG 32 00 
PGVWRRGDQVFAEAEVD 
GGACCGGACGGTTTCGTGGTGCACCCCGACCTGCTCGACGCGGTCTTCTC 32 50 

GPDGFVVHPDFLDAVFS 
CGCGGTCGGCGACGGAAGCCGCCAGCCGGCCGGATGGCGCGACCTGACGG 3300 

AVGDGSRQPAGWRDLT 
TGCACGCGTCGGACGCCACCGTACTGCGCGCCTGCCTCACCCGGCGCA.ee 33 5 0 
V H A S DATVLRACL TRRT 
GACGGAGCCATGGGATTCGCCGCCTTCGACGGCGCCGGCCTGCCGG7ACT 34 00 

DGAMGFAAFDGAGLPVL 
CACCGCGGAGGCGGTGACGCTGCGGGAGGTGGCGTCACCGTCCGGCTCCG 3 4 50 

TAEAVTLREVAS P S G S 
AGGAGTCGGACGGCCTGCACCGGTTGGAGTGGCTCGCGGTCGCCGAGGCG 3500 
FES D G L H R L E W L A V A E A 
GTCTACGACGGTGACCTGCCCGAGGGACATGTCCTGATCACCGCCGCCCA 3550 

VYDGDLPEGHVL I TAAH 
CCCCGACGACCCCGAGGACATACCCACCCGCGCCCACACCCGCGCCACCC 3 600 

PDDPEDI PTRAHTRAT 
GCGTCCTGACCGCCCTGCAACACCACCTCACCACCACCGACCACACCCTC 36 50 
RVLTALQHHLTTa DHTL 
ATCGTCCACACCACCACCGACCCCGCCGGCGCCACCGTCACCGGCCTCAC 37 00 

IVHTTTDPAGATVTGLT 
CCGCACCGCCCAGAACGAACACCCCCACCGCATCCGCCTCATCGAAACCG 37 50 

RTAQNEHPKRIRLIET 
ACCACCCCCACACCCCCCTCCCCCTGGCCCAACTCGCCACCCTCGACCAC 38 00 
DHPHTPLPLAQLATLDH 
CCCCACCTCCGCCTCACCCACCACACCCTCCACCACCCCCACCTCACCCC 38 50 

PHLRLTHHTLHH PHLTP 
CCTCCACACCACCACCCCACCCACCACCACCCCCCTCAACCCCGAACACG 3900 

LHTTTPPTTTPLNPEH 
CCATCATCATCACCGGCGGC7CCGGCACCCTCGCCGGCATCCTCGCCCGC 3950 
AIIITGGSGTLAGILAR 
CACCTGAACCACCCCCACACCTACCTCCTCTCCCGCACCCCACCCCCCGA 4 000 

HLNHPHTYLLSRTPPPD 
CGCCACCCCCGGCACCCACCTCCCCTGCGACGTCGGCGACCCCCACCAAC 4 050 
AT PGTHL PCDVG DPHQ 
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TCGCCACCACCCTCACCCACATCCCCCAACCCCTCACCGCCATCTTCCAC 4 100 
L A T T L T H I ? Q P L T A I F H 

accgccgccaccctcgacga c g g c a t c c t c c a g g g g g 7 c a c g g g g g a c c g 4 150 

taatlddgzlhaltfdr 
cgtg^cgaggg7CG7Gcaggggaaagggaaggggg:g7Ggcagc7Ggagc 4200 

l t t v lhp k a n a a w h l h 
acctcacggaaaaccaacccgtgagggacttcgtgcgctactccaoggcg 4 250 
h l t q n q p l t h f v l y s s a 
g c c g c c g t c c t c g g c a g c c c c g g a c aa g g aaa g 7 a c g c c g g g g c g a v3 g g 4 300 

aavlgspgqgny a a a n a 
c t t c c 7 c g a c g g g c t c g g c a g g c a g g g g g a g a g g g 7 g g g g c aa g g c g c c a 4 350 

F L D A L A T K R H T 7 G Q ? A 
CGTCeATCGGGTGGGGGATGTGGGAGAGGAGGAGGAGCGTGAGCGGAGAA 4 4 00 
7 S I A W G M W H 7 7 S 7 L 7 G Q 
G7CGAGGACGGCGACGGGG AGGGGA7GGGGC->GG jCoG iTI G'~7^C_ i 4 4 50 
LDDADP. DRIP, RGGFL? I 
M C o G A C G A C G A G G -o G A 7 G G G G A 7 G G A 7 
T D D E G 

The /ivrll-JWzoI restriction fragment that encodes module 8 of the FK-520 PKS 
with the endogenous AT domain replaced by the AT domain of module 13 (specific for 
methylmalonyl CoA) of the rapamycin PKS has the DNA sequence and encodes the 
amino acid sequence shown below. 

AGATG7GGGAGG7GGGCGAAGGGG7GGTGACGGTGGTGCGGGAGAGGAGG 5 0 

QLAEALLTLVRES7 
GCCGCCGTGCTCoGCCACGTGGGTGGCGAGGACATCCCCGCGACGGCGGC 100 

AAVLGHVGGEDI P A 7 A A 
GTTCAAGGACCTCGGCATCGACTCGCTCACCGCGGTCCAGCTGCGCAACG 150 

F K D L G I D S L T A V Q L R N 
CCCTCACCGAGGCGACCGGTGTGCGGCTGAACGCCACGGCGC-rCTTCGAC 200 
A L T E A T G V R L N A T A V F D 
T7CCGGAGGCGGGAGG7GCTGGGGGGGAAGGTGGGCGAGGAAG7GACGGG 250 

FPTPHVLAGKLGDELTG 
CACCCGCGCGCCCGTCGTGCCCCGGACCGCGGCCACGGCCGGTGCGCACG 300 

7RAPVVPRTAA7AGAH 
ACGAGCCGCTGGCGATCG7GGGAATGGCCTGCCGGCTGCCCGGCGGGGTC 3 50 
D F P I A I V G M A C ?. L P G G V 
GCGTCACCCGAGGACCTGTGGCACCTCGTGGCATCCGGCACCGACGCCAT 4 00 

A S P E E L W H L V A S G T D A I 
CACGGAGTTCCCGACGGACCGCGGCTGGGACGTCGACGCGATCTACGACC 4 50 

TEFPTDRGWDVDAI YD 
CGGACCCCGACGCGATCGGCAAGACCTTCGTCCGGCACGGTGGCTTCCTC 500 
P DPDAI GKTFVRHGGFL 
ACCGGCGCGACAGGCTTCGACGCGGCGTTCTTCGGCATCAGCCCGCGCGA 550 

TGATGFDAAFFG'SPRE 
GGCCCTCGCGATGGACCCGCAGCAGCGGGTGCTCCTGGAGACGTCGTGGG 600 

ALAMDPQQRVLLETSW 
AGGCGT7CGAAAGCGCCGGCATCACCCCGGACTCGACCCGCGGCAGCGAC 650 
EAFESAGI7 PDSTRGS D 
ACCGGCGTGTTCGTCGGCGCC7TCTCCTACGGTTACGGCACCGGTGCGGA 7 00 

TGVFVGAFSYGYGTGAD 
CACCGACGGCTTCGGCGCGACCGGCTCGCAGACCAGTGTGCTCTCCGGCC 7 50 

TDGFGA7GSQTSVLSG 
GGCTGTCGTACTTCTACGGTC7GGAGGGTCCGGGGGTCACGGTCGACACG 8 00 
RLSYFYGLEG PAVTVDT 
GCGTGTTCGTCGTCGCTGGTGGCGCTGCACCAGGCCGGGCAGTCGCTGCG 8 50 

ACSSS L.VALHQAGQSLR 
CTCCGGCGAATGCTCGCTCGCCCTGGTCGGCGGCGTCACGGTGATGGCGT 900 

SGECS LALVGGVTVMA 
CTCCCGGCGGCTTCGTGGAGT7C7GCCGGCAGCGGGGCCTCGCGCCGGAC 950 
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SPGGFVEFSRQRGLAPD 

'•j ^ ; - 'j o '-j G o r-_"jo o ^cTTCG GCG ~ ■ -j G - ~ - 'o 'o v ^ ..7 o G rvG o Ao CTTCGCl^.^ 1000 

G R A K A F G A G A D G T S F A E 
GGG7G7GGG7GTGGTGA7GG7GGAGAGGG7G7GCGAGG 3GGAAGGGAAGG 1 C 50 
5 G A G V L I V* £ P, L 3 G A E R N 

G H T V L A V V R G S A V N Q D G 
GGG7GGAAGGGGCTG7GGGGGGGGAAGGGGGGG7GGCAGGAG3GGGTGA7 1150 
A S M G L 5 A P N GPSQERVI 
10 GGGGGAGGGGC7GGGGAACGGCGGGC7GAGGGGGGGGGAGGTGGACGGGG 1200 
R Q A L A M A G L T P A D V D A 
TCGAGGGGGACGGCAGCGGCAGGAGGG7GGGGGAGGGGATGGAGG0ACAG I 2 5 0 
V E A H G 7 G T R L G D F I FAQ 
v_j i _oo . A.G rG'^GCACC , AGGGAGAGGAGGGGGGGAuGGGGG7GG7GGl Gu'L: j_ j ^ : c 
15 A V L A T V G Q F R A 7 ? G G L G 

-.-T'^o^T j.-u'ioT GG A„a G.*%T C GGCC.-iC GC G G AG G G G'-jG G - GGG-GGG •'CG'G.,G _ ~ r^o 

3 L K S N I G H A Q A A £ G 7 A 
GCAl GA7GAAuGA7GG7GGAGGGCGTGGGGGAG^G ^o^vOG GGGGGGGAG^ ^ 4 00 
GIIKMVQALRHGELPPT 
20 G i GC ACG^'^GAiCoAGCCoTCGCC'oCACGTwGACT oGACGGCCGoCGCC^o 1 * -1 o'J 
L K A D E P S P H V D W T A G A V 
GGAAGTGGTGAGGTCGGCCCGGCCGTGGCCGGAGACGGAGCGGGGGAGGG 1500 

ELL T GAR? W PET D R P R 
uoG^'o^G^lG GTCGTCGTTCGGAGTCAGCvjGCAGGAA.CGGCGAGG7Crv7G 50 
25 R A G V S £ F G V S G T N A H V I 

G7GGAGAGGGGAGCGGCGGG7GAGGGGGGGGAGGAGGCGCAGCGTG77GA 1 6 0 0 

L E S A P P A Q P A E F A Q P V E 
GAGGGGGG7GG7GGGCTGGGATGTGGTGGCGGTGG7GATA.TGGGGGAAGA 16 50 
TPVVASDVLPLVISAK 
30 CCCAGCCCGCCCTGACCGAACACGAAGACCGGCTGCGCGCCTACCTGGCG 17 00 
TQPALTEHEDRLRAYLA 
GCGTCGCCCGGGGCGGATATACGGGCTGTGGCATCGACGCTGGCGGTGAC 17 50 

A S ? G A D I R A V A S T L A V T 
ACGGTCGGTGTTCGAGCACCGCGCCGTACTCCTTGGAGA7GACACCGTCA 18 00 
35 R S V F E H R A V L L G D D T V 

CCGGCAGGGCGGTGAGCGACCCGAGGATGG7GTTTG7 G777CCCGGGCAG 1 S 5 0 
7 G 7 A V 7 D P R I V F V F P G Q 
GGGTGGCAG7GGCTGGGGATGGGCAGTGGAG7GGGCGA7TCG7CGG7GG7 19C0 
GWQWLGMGSALRDSSVV 
40 o7TGGGGGAGGGGATGGGCGA i G7GTGGGGGGGCo77GG'oGoA,GT7CG7GG 1 i j 5 0 
FAERMAECAAALRE F V 
ACTGGGA7GTGT7CAGGG7TC7GGA7GATGCGGGGGTGG7GGACCGGGTT 2000 
0 W D L F T V L D D P A V V D R V 
GATGTGG7CCAGCCCGC7TCCTGGGCGATGATGGTTTCCCTGGCCGCGGT 2 0 50 
45 D V V Q P A S W A M M V S L A A V 

GTGGCAGGCGGCCGGTGTGCGGCCGGATGCGGTGATCGGCCATTCGCAGG 2100 

WQAAGVRPDAVIGHSQ 
GTGAGATCGCCGCAGCTTGTGTGGCGGGTGCGGTGTCACTACGCGATGCC 2150 
GEIAAACVAGAVSLRDA 
50 GCCCGGATCGTGACCTTGCGCAGCCAGGCGATCGCCCGGGGCCTGGCGGG 22 00 
ARI VTLRSQAIARG LAG 
CCGGGGCGCGATGGCATCCGTCGCCCTGCCCGCGCAGGATGTCGAGCTGG 22 5 0 

RGAKASVALPAQDVEL 
TCGACGGGGCCTGGATCGCCGCCCACAACGGGCCCGCCTCCACCGTGATC 23 00 
55 VDGAWIAAKN GPASTVI 

GCGGGCACCCCGGAAGCGGTCGACCATGTCCTCACCGCTCATGAGGCACA 2350 

AGTPEAVDHVLTAHEAQ 
AGGGGTGCGGGTGCGGCGGATCACCGTCGACTATGCCTCGCACACCCCGC 2 4 0 0 
GVRVRRI TVDYASHTP 
60 ACC-7CGAGCTGATCCGGGA3GAAGTAGTGGACATGACTAGCGACAGCAGC 2 4 50 
HVELIRDELLDITSDSS 
TCGCAGACCCCGC7CG7GCCG7GGC7G7CGACCGTGGACGGCACCTGGGT 2500 

SQTPLVPWLSTVDGTWV 
CGACAGCCCGCTGGACGGGGAGTACTGGTACCGGAACCTGCGTGAACCGG 2550 
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DSPLDGEYWYRNLREP 
TCGGT7TCCACCCCGCCG7CAGCCAGT7C7AGGCCCAGGGGGACACCGTG 2 60 0 
V -G F H P A V S Q Q A Q G D 7 V 

T 7 C G 7 C G A G G T C A G C G C C A G C C C G G T G 7 7 G 7 7 G C A G G C G A T G G AC G A C G A 2 650 
5 F V F Y S A S ? 7 L L Q A M D D D 

V V T 7 A T L R R < G G D A T R 
7GC rC AGv-o — ^ — TG v,j ^ A^CAG^CC 7 A7^ T _ G.~-iCGG i ~ ^>7 G^-^C^TC GAC '* ^'-^ 2 7 5 0 
M L T A L A Q A V V H G V T V D W 
10 C C C G C C A 7 C C 7 C G G C A C C A C C A C AA C C C G G G 7 A C 7 G G A C C 7 ? C C G A C C 7 A 2 8 00 
PAI LGTTTTRVLDLPTY 
CG CC 7 ? C C\AAC AC C AG C GG 7 AC 7 G G C T C GAG T C G G C A C G C CC G G CC G CAT 2 850 

A F Q H Q R Y W L E S A R P A A 
CCGACGCGGGCC ACCCCGTGC7GGGCTCCGG7A7CGCCC7CGCCGGG7CG 2 300 
15 S D A G H P V L G S G I A 7 A G S 

C C G G G G C G G G 7 G 7 T C A C G G G 7 7 C 7 G 7 G 2 C G A 2 C G G 7 G G G G A C G G C G C G G 7 2 950 

? G ?. V F T G S V P 7 G A D R A 7 
GTTCG . ■„ o-.^voAGCTGGCGG7GGCGGGGGCG ■^ACG'" GG~.C7G'~GG^2-.. j 000 
F V A E L A L A A A D A 7 D C A 
20 CGGTCGAGCGGC7CGA.CA7CGCC7CCG7GCGCGGCCGGCCGGGCCATGG G 5 050 
TVERLDIASVPGRPGHG 
C G G A C G A C C G 7 A C A G A C C 7 G G G 7 C G A C G A G C C G G C G G A CGACGGCCGG C G 310 0 

R 7 T 7 Q T W V D E P A D D G R R 
CCGG7TCACGG7GCACACCCGCACGGGGGACGGGCCG7GGACGC7GCACG 2150 
25 R F 7 V H T R T G D A P W T L H 

CCGAGGGGGTGC7GCGCCCCCATGGCACGGGCCTGCCCGATGCGGGCGAC 3200 
AEGV7RFHGTALPDAAD 
CCCGAGTGGCCCCCACCGGGCGCGGTGCCCGCGGACGGGCTGCCGGGTGT 32 5 0 
AEWPPPGAVPADGLPGV 
30 GTGGCGCCGGGGGGACCAGGTCT7CGCCGAGGCCGAGGTGGACGGACCGG 3 300 
WRRGDQVFAEAEVDGP 
ACGGTTTCG7GGTGCACCCCGACCTGC7CGACGCGGTC7TCTCCGCGGTC 33 50 
DGFVVHPDLLDAVFSAV 
GGCGACGGAAGCCGCCAGCCGGCCGGATGGCGCGACCTGACGGTGCACGC 34 00 
35 GDGSRQPAGWRDLTVHA 

GTCGGACGCCACCGTACTGCGCGCCTGCCTCACCCGGCGCACCGACGGAG 34 50 

S DATVLRACLTRRTDG 
CCATGGGATTCGCCGCCTTCGACGGCGCCGGCCTGCCGGTACTCACCGCG 3500 
AMG FAAFDGAGL PVLTA 
40 GAGGC3GTGACGCTGCGGGAGGTGGCGTCACCGTCCGGCTCCGAGGAGTC 3550 
EAVTLREVASPSGSEES 
GGACGGCCTGCACCGGTTGGAGTGGCTCGCGGTCGCCGAGGCGGTCTACG 3 600 

DGLHRLEWLAVAEAVY 
ACGG7GACCTGCCCGAGGGACA7GTGC7GATCACCGCCGCCCACCCCGAC 3 650 
45 DGDLPEGHVLITAAKPD 

GACCCCGAGGACATACCCACCCGCGCCCACACCCGCGCCACCCGCGTCCT 37 00 

DPEDI PTRAHTRATRVL 
GACCGCCCTGCAACACCACCTCACCACCACCGACCACACCCTCATCGTCC 37 50 
TALQHHLTTTDHTLIV 
50 ACACCACCACCGACCCCGCCGGCGCCACCGTCACCGGCCTCACCCGCACC 3800 
HTTTDPAGATVTGLTRT 
GCCCAGAACGAACACCCCCACCGCATCCGCCTCATCGAAACCGACCACCC 3850 

AQNEKPHRI RLI ETDHP 
CCACACCCCCC7CCCCCTGGCCCAACTCGCCACCCTCGACCACCCCCACC 3 900 
55 HTPLPLAQLATLDHPH 

TCCGCCTCACCCACCACACCCTCCACCACCCCCACCTCACCCCCCTCCAC 3 950 
LRLTHHTLHHPHLTPLH 
ACCACCACCCCACCCACCACCACCCCCCTCAACCCCGAACACGCCATCAT 4 000 
TTTPPTTTPLN'PEHAII 
60 CATCACCGGCGGCTCCGGCACCCTCGCCGGCATCCTCGCCCGCCACCTGA 4 050 
ITGGSGTLAGI LARHL 
ACCACCCCCACACCTACCTCCTCTCCCGCACCCCACCCCCCGACGCCACC 4100 
NHPHTYLLSRTPPPDAT 
CCCGGCACCCACCTCCCCTGCGACGTCGGCGACCCCCACCAACTCGCCAC 4150 
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PGTHLPCDV'G D F H Q L A T 
CACCCTCACCCACA7CCCCCAACCCC7CACCGCCA7C77CCACACCGCCG 4 2 00 

T . L T H I P Q P L T A I F H 7 A 
CCACCC7CGACGACGGCATCC7CCACGCCC7CACCCCCGACCGCCTCACC 4 2 50 
ATLCCGILHA77PDRLT 
A C CG TC C 7CC AC CCCAAAGCCAACGCCGCC7GGCACCTGC AC CACC CCA C 4 300 

TV A H ? K A A A W H L H H L T 

CCAAAACCAACCCC7CACCCAC7TCGTCC7C7AC7CCAGCGCCGCCGCCG 4 3 50 

Q N Q P 7 T H F V L Y S 3 A A A 
7 C C 7 C G G C A G C C C C G G A C AA G G A A A C T A C G C C G C C G C C AA CGCCTTCCTC 4 4 00 
VLGS P G Q G N Y A A A N A F L 
CACGCCC7CGCCACCCACCGCCACACCC7CGGCCAACCCGCCACCTCCAT 4 4 50 

DALATHRHTLGQPATSI 
CGCC7GGGGCATGTCGCACACCACCAGCACCCTCACCGGACAACTCGACG 4 5 00 

A W G X W H T T S 7 L T G 7 L D 
ACGCCGACCGGGACCGCA7CCGCCGCGGCGGTT7CC7CCCGA7C. : lCGGAC 4 5 50 
D A G R D R - R R GRGPITG 

AC G ri'o- 'o ■„• J G GG G A.T G GAT 

D E G 

The Nhell-XJiol restnction fragment that encodes module 8 of the FK-520 PK.S 
with the endogenous AT domain replaced by the AT domain of module 12 (specific for 
malonyl CoA) of the rapamycin PKS has the DNA sequence and encodes the amino acid 
sequence shown below. 

AGATCTGGCAGCTCGCCGAAGCGCTGCTGACGCTCGTCCGGGAGAGCACC 50 

QLAEALLTLVREST 
GCCGCCG7GCTCGGCCACGTGGGTGGCGAGGACATCCCCGCGACGGCGGC 100 

AAVLGHVGGEDX P A T A A 
GTTCAAGGACCTCGGCATCGACTCGCTCACCGCGGTCCAGCTGCGCAACG 150 

FKOLGI DSLT AVQLRN 
C C C T C AC C G AG G CG AC C G G T G T G C GG C 7 G AAC G CC AC GGCGCTC T TCG AC 200 
ALTEATGVRL N A 7 A V F D 
7 T C C C G A C C CCGCACGTGCTCGCCG G G FJ\ G C 7 C G G C G A C G AA CTGACCGG 2 50 

EPTPHVLAGKLGDELTG 
CACCCGCGCGCCCG7CGTGCCCCGGACCGCGGCCACGGCCGGTGCGCACG 300 

T R A P V V P R T A A 7 A G A H 
ACGAoC wG .^7GGCGATCGTGGGAA7GGCC J'GCCvjG'^TGCCCvjG^'oGGG i C ^50 
DEPLAIVGXACRLPGGV 
GCGTCACCCGAGGAGCTGTGGCACCTCGTGGCATCCGGCACCGACGCCAT 4 00 

ASPEELWKLVASGTDAI 
C AC G G AG T 7 G C CG AC GG ACC G CG G C T GG G AC G 7 C G AC G C G AT C T ACG AC C 4 50 

T E F P ? D R G W D V D A I YD 
CGGACCCCGACGCGATCGGCAAGACCTTCG7CCGGCACGGTGGC7TCCTC 500 
PDPDAI GKTFVRHGGFL 
ACCGGCGCGACAGGCTTCGACGCGGCGTTCTTCGGCATCAGCCCGCGCGA 55 0 

TGATGFDAAFFGI S PRE 
GGCCCTCGCGATGGACCCGCAGCAGCGGGTGCTCCTGGAGACGTCGTGGG 600 

ALAMDPQQRVLLETSW 
AGGCGTTCGAAAGCGCCGGCATCACCCCGGACTCGACCCGCGGCAGCGAC 650 
EAFESAG ITPDSTRGSD 
ACCGGCGTGT7CGTCGGCGCC77C7CC7ACGGTTACGGCACCGGTGCGGA 70 0 

7 G V F V G A F S Y G Y G 7 G A D 
CACCGACGGCTTCGGCGCGACCGGCTCGCAGACCAGTGTGCTCTCCGGCC 75 0 

TDGFGATGSQ7SVLSG 
GGCTGTCGTACTTCTACGGTCTGGAGGGTCCGGCGGTCACGGTCGACACG 800 
RLSYFYGLRGPAVTVDT 
GCGTGTTCGTCGTCGCTGGTGGCGCTGCACCAGGCCGGGCAGTCGCTGCG 850 

ACSSSLVALHQAGQSLR 
CTCCGGCGAWGC7CGCTCGCCCTGGTCGGCGGCGTCACGGTGATGGCGT 900 

SGECSLALVGGVTVMA 
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CTCCCGGCGGCTTCGTGGAGTTCTCCCGGCAGCGCGGCCTCGCGCCGGAC 950 
S ? G G F V E F S R Q R G L A P D 
G G C C G G G C G A A G G G G T T C G G G G G G G G T G G G G A C G G G A. C G A G C T T C G G C G A 1 0 C 0 
G R A KAFGAGADGTS F A E 
5 GGC7GCGGGTGTGCTGATCGTCGAGAGG3TCTCCGAGGCCGAACGCAAGG 10 50 
G A G V I, I V £ R L S D A E R N 
GTCACAGCGTGGTGGGGGTGGTCGGTGGTTCG3CGGTCAACCAGGATGG7 1100 
G H T V L A V V R G S A V N Q D G 
GCGTGGAACGCGGTGTCGGCGCCGAACGGGGGGTCGCAGGAGCGGGTGAT 1150 
10 A S K G L S A ? N G P S 0 E R V I 

^GGG-^ A.GG GGC T GGCCTAACGCCGGGCTC ACCCCGGCGGA.CGTGGACGCC ^ -i c 0 0 

R Q A L A N A G L T P A D V D A 
TCGAGGCCCACGGCACCGGCACCAGGCTGGGCGACCCCATCGAGGCACAG 12 50 
VEAHGTGTRLGDPIEAQ 
15 j^jG . .nCTouCCA.GCT AGuG/iCA.GGAGGGCGC ~ A^Cl-CCGTGCTGCTGo'o 1 j00 
A V L A T Y G Q E R A T ? L L L G 

:::z"\r\ • \ - agatcggccagg::ccag3gcgcgtgcgggg t g3G~g 1350 

- . r\ c N ^ G r 1 A. Q — A, S ^ v 
G3A3 GA.TCA^AGATGGTGCAGGCGGT ZCGGGhZGGGGAGCZGCCGCCGACG 1 4 00 
20 G I I K M V Q A L R H G E L P P T 

CTGCACGCCGACGAGCCGTCGCCGCACGTCGACTGGACGGCCGGCGCCGT 14 50 

LHADEPSPHVDWTAGA V 
CGAACTGCTGACGTCGGCCCGGCCG7GGCCCGAGACCGACCGGCCACGGC 15 00 
ELLTSARPWPETDRPR 
25 GTGCCGCCGTCTCCTCGTTCGGGGTGAGCGGCACCAACGCCCACGTCATC 15 50 
R A A V S S T G V S G T N A H V I 
CTGGAGGCCGGACCGGTAACGGAGACGCCCGCGGCATCGCC7TCCGGTGA 1600 

L E A C PVTETPAAS PSGD 
CCTTCGOCTGCTGGTGTCGGCACGCTCACCGGAAGCGCTCGACGAGCAGA 1650 
30 LP LLVSARSPEALDEQ 

TCCGCCG ACTGCGCGCCTACCTGGACACCACCCCGGACGTCGACCGGGTG 1 7 C 0 
I RR LRAY LDTTPDV DRV 
GCCGTGGCACAGACGCTGGCCCGGCGCACACACTTCGCCCACCGCGCCG? 17 50 
AVAQTLARRTH FAH RAV 
35 GCTGCTCGGTGACACCGTCATCACCACACCCCCCGCGGACCGGCCCGACG 18 00 
LLGDTVITTPPADRPD 
AACTCGTCTTCGTCTACTCCGGCCAGGGCACCCAGCATCCCGCGATGGGC 18 50 
ELV FVYSGQG7QHPAMG 
GAGC^GCTkGCCGGCGCGTTCCCCGTCTTCGCGCGGATCChTCAGCAGGT 1 9C0 
40 E Q L A A A F ? V F A R I H Q Q V 

GTGGGACCTGCTCGATGTGCCCGATCTGGAGGTGAACGAGACCGGTTACG 19 50 

WDLLDVPDLEVNETGY 
CCCAGCCGGCCCTGTTCGCAJVTGCAGGTGGCTCTGTTCGGGCTGCTGGAA 2000 
AQPALFAMQVALFG LLE 
45 TCGTGGGGTGTACGACCGGACGCGGTGATCGGCCATTCGGTGGGTGAGCT 20 50 
5 W G V R ? D A V I G H S V G E L 
TGCGGCTGCGTATGTGTCCGGGGTGTGGTCGTTGGAGGATGCCTGCACTT 2100 

AAAYVSGVWSLE DACT 
TGGTGTCGGCGCGGGCTCGTCTGATGCAGGC:3.rGCCC-GGGGTGGGGTG 2150 
50 LVSARARLMQALPAGGV 

ATGGTCGCTGTCCCGGTCTCGGAGGATGAGGCCCGGGCCGTGCTGGGTGA 22 00 

MVAVPVSEDEARAVLGE 
GGGTGTGGAGATCGCCGCGGTCAACGGCCCGTCGTCGGTGGTTCTCTCCG 22 50 
GVEIAAVNGPSSVVLS 
55 GTGATGAGGCCGCCGTGCTGCAGGCCGCGGAGGGGCTGGGGAAGTGGACG 2 3 00 
GDEAAVLQAAEGLG K W T 
CGGCTGGCGACCAGGCACGCGTTCC ATTCCGCCCGTAT GGA^ACCCATGCT 2 3 5 0 

RLATSHAFHSARMEPML 
GGAGGAGTTCCGGGCGGTCGCCGAAGGCCTGACCTACCGGACGCCGCAGG 2 4 00 
60 EE FRAVAEGLTYRTPQ 

TCTCC ATGGCCGTTGGTGATCAGGTGACC ACCGCTG AGT ACTGGGTGCGG 2 4 5 0 
VSMAVGDQVTTAEYWVR 
CAGGTCCGGGACACGGTCCGGTTCGGCGAGCAGGTGGCCTCGTACGAGGA 2500 
QVRDTVRFGEQVAS YED 
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CGCCGTGTTCGTCGAGCTGGG7GCCGA , wCGoTCA.CTGGCCCGCCTGGTCG 2 5 5 0 

A V F V E L G A 0 P. S L A R L V 
ACGGTGTGG C G A T G C T G C A C G G C G A C C A G G AAA. T G C A G G C C G C G A. 7 G \ j G C 2 600 
D G V A M L H G D H E I Q A A I G 
5 G G G C 7 G G G C G A G C 7 G 7 A T G 7 C AA G G G G G T G A C G G 7 C G A C 7 G G C C C G C G G T 2 65 0 
A L A K L Y V \\ G V T V D W P A L 
CC7GGGCGATGC r "CCGGCAAGACGGG7GCTGGACCTTCCGACATACGGC7 2 7 00 

G G DAP A 7 R V L D L P T Y A 
7 C G AG G AG C AG G G G 7 AC TG G G 7 C G AG 7 G G G G AG G C C G G GC C G C AT C GG AG 2 7 50 
10 F Q [1 Q R Y W L E S A R P A A S D 

GCGGGCCACCCCGTGC7GGGG7GCGG7ATCGCCC7CGCCGGGTCGCCGGG 2 8 00 

AG H PVLG3GIALAGSF G 
CCGGG7GT7GACGGGT7CCG7GCGGAGCGGTGGGGAGGGCGGGGTGTTGG 28 50 
RVFTGSVPTGADRAVF 
1 5 TCGC CGAGCT GGGG CTGGCGGC GGCGG ACGCGG 7GG AGTG Co CC A.CG <jT v.. 2900 
7 A E L A L A A A D A V D C A T V 
" 7G G G T GG AG ATC G ,w " , ""Tjw_-jo GGG " G G GGGCA.TGGG ~ o ; j AG 2 9 50 
E R 7 G I A S PGRPGHGR7 
GACGG7AC AG AGG7GGG7CG AGGAGCCGGCGGACGACoG , -_ .CGb'^G^Co ^ . .'-0 00 
20 7 V Q 7 W V D E P A 0 D G R R R 

TCACCGTGCAGACCCGCACCGGCGACGCCCCGTGGACGCTGCACGCCGAG 3 050 
FTVHTRTGDAPWTLHAE 
GGGGTGCTGCGCCCCCATGGCACGGCCCTGCCCGATGCGGCCGACGCCGA 3100 
GVLRPHGTALPDAADAE 
25 GTCGCCCCCACCGGGCGCGG7GCCCGCGGACGGGCTGCCGGGTGTGTGGC 3150 
W P PPGAVPADGLPG V W 
GCCGGGGGGA.CCAGG7CTTCGCGGAGGCCGAGGTGGACGGACCGGACGGT 320 0 
RRGDQVFAEAEVDGPDG 
TTCGTGGTGCACCCCGACCTGCTCGACGCGGTCTTCTCCGCGGTCGGCGA 32 50 
30 F V V H P D L 7 D A V F S A V G D 

CGGAAGCCGCCAGCCGGCCGGATGGCGCGACCTGACGGTGCACGCGTCGG 3 300 

G3 RQPAGWRDLTVHAS 
ACGCCACCGTACTGCGCGCCTGCCTCACCCGGCGCACCGACGGAGCCATG 3 350 
DATVLRACLTRRTDGAM 
35 GGATTCGCCGCCTTCGACGGCGCCGGCCTGCCGGTACTCACCGCGGAGGC 34 00 
G FAAFDGAGLPVLTAEA 
GGTGACGCTGCGGGAGGTGGCG7CACCGTCCGGG7CCGAGGAGTCGGACG 34 50 

V T L REV ASPS GSEESD 
GCCTGCACCGGTTGGAGTGGCTCGCGGTCGCCGAGGGGGTCTACGACGGT 3 5C0 
40 G L H R 7 E W L A V A E A V Y 0 G 

GACCTGCCCGAGGGACATGTCC7GA7CACCGCCGCCCACCCCGACGACCC 3 55 0 

DLPEGHVLITAAH PCDP 
CGAGGACATACCCACCCGCGCCCACACCCGCGCCACCCGCGTCCTGACCG 3 600 
EDI PTRAHTRATRVLT 
45 CCCTGCAACACCACCTCACCACCACCGACCACACCCTCATCGTCCACACC 3650 
ALQHHLTTTDHTLIVHT 
ACCACCGACCCGGGCGGCGCCACGGTGACCGGCCTCACCCGCACCGCCGA 37 00 

TT DPAGATVTGLT RTAQ 
GAACGJ\ACACCCCCACCGCATCCGCCTCATCGAAACCGACCACCCCCACA 37 50 
50 NEKPHRIRLIETDHPH 

CCCCCCTCCCCCTGGCCCAACTCGCCACCCTCGACCACCCCCACCTCCGC 3 8 00 
TPLPLAQLATLDHPHLR 
CTCACCCACCACACCCTCCACCAGCCCCACCTCACCCCCCTCCACACCAC 38 50 
LTHHTLHKPHLTPLHTT 
55 CACCCCACCCACCACCACCCCCCTCAACCCCGAACACGCCATCATCATCA 3900 
7PPTTTPLNPEKAI II 
CCGGCGGCTCGGGCACCCTCGCCGGCATCCTCGCCCGCCACCTGAACCAC 3 950 
TGGSGT L A G I LARHLNH 
CCCCACACCTACCTCCTCTCCCGGACCCCACCCCCCGACGCCACCCCCGG 4 000 
60 PHTYLLSRTPPPDATPG 

CACCCACCTCCCCTGCGACGTCGGCGACCCCCACCAACTCGCCACCACCC 4 050 

THLPCDVGDPHQLATT 
TCACCCACATCGCCCAACCCCTCACCGCCATCTTCCACACCGCCGCCACC 4100 
LTH I PQPLTAI FHTAAT 
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CTCGACGACGGCATCCTCCACCCCCTCACCCCCGAGCGGCTCACCACCGT 4 150 

LDDGILHALTPDRLTTV 
CCTGGACCCCAAAGCCAAGGCGGGC?GGGAGG7GGAGCACG':GACCCAAA 4 200 

L H P K A N A A V. 1 H L H H L T Q 
ACCAACGCCTCACCCAGTTGGTCCTGTACTCCAGGGGG3GCGCGG7CCTG 4 25 0 
N Q P L T H F V L Y S S A A A V L 
GGCAGCCGCGGAGAAGGAAACTAGGGCGGGGGCAACGCGTTGCTGGAGGC 4 300 

G 3 F G Q G U Y A A A N A F L D A 
CGC;, o JCA.GCCACCGCCACACCCTCGGCCAJ\CCCGC CACCTC GAToGCCC 4 3 50 

L A T H R H T L G Q F A T S I A 
GGGGCATGTGGCACACGACCAGCACCCTCACCGGACAAeTGGACGAGGC: 4 4 00 
W G M W HTTSTLTGQLDDA 
G A.CCGGG ACC GC AT CCGC C GCGG CGGTTTCCTCCGGA.7C A GGG A GG AC G A 4 4 50 

D R D R I R R G G F L P I T D D E 
G G G C AT G G G G AT G C AT 

G 

The NheW-Xliol restriction fragment that encodes module 8 of the FK-520 rKS 
with the endogenous AT domain replaced by the AT domain of module 13 (specific for 
methylmalonyl CoA) of the rapamycin PKS has the DKA sequence and encodes the 
amino acid sequence shown below. 

AGATCTGGCAGCTCGCCGAAGCGCTGCTGACGCTCGTCCGGGAGAGCACC 50 

QLAEALLTLVP, EST 
GoCG woToCTCGG^CrtCGTGGoT'o^CGAGGACA-TCC'^CGCG.HC'oO'^^o^ 10 0 

A A V E G H V G G E 0 I ? A T A A 
GTTCA.AGGACCTCGGCATCGACTGGCTCACCGCGGT CCA.GCTGCGCAAC^ ^50 

FKDLGI D S L T A V Q L R N 
CCCTCACCGAGGCGACCGGTGTGCGGCTGAACGCCACGGCGGTCTTCGAC 2 00 
ALT EATGVRLNATAVFD 
TTCCCGACCCCGCACGTGCTCGCCGGGAAGCTCGGCGACGAACTGACCGG 2 50 

FPT PHVLAGKLGDELTG 
CACCCGCGCGCCCGTCGTGCCCCGGACCGCGGCCACGGCCGGTGCGCACG 3 00 

TRA PVVPRTAATAGAH 
ACG.nGCGGCTGGCGATCGTGGG AJ-.T G G C C 0. G C Co G L T Co G GC GG G G T C j 5 0 
E E P L A I V G M A C R E ? G G V 
GCGTC ACCCG AGG AGCTGT GGC ACC T GGTGGCATCC GGCACCGACGCCA.T 4 00 

A S P E E L W H L V A S G T D A I 
CACGGnGTTCCCGACGGACCGCGGCTGGGA-CGTCGACGCGA.TCTA.CGA.CC 4 50 

TEFPTDRGWDVDAIYD 
CGGACCCCGACGCGATCGGCAAGACCTTCGTCCGGCACGGTGGCTTCCTC 5 00 
PDPDAIGKT FVRHGGFL 
ACCGGCGCGACAGGCTTCGACGCGGCGTTCTTCGGCATCAGCCCGCGCGA 5 50 

TGATGFDAAFFGISPRE 
G GC CCTCGCG AT GG AC CCGCAGC AG CGGGTGCTCCTGGAGACGTCGTGGG 600 

ALAMDPQQRVLLETSW 
AGGCGTTCGAAJ^GCGCCGGCATCACCCCGGACTCGACCCGCGGCAGCGAC 65 0 
EAFESAGIT PDSTRGSD 
ACCGGCGTGTTCGTCGGCGCCTTCTCCTACGGTTACGGCACCGGTGCGGA 7 00 

TGV FVGAFS YGYGTGAD 
CACCGACGGCTTCGGCGCGACCGGCTCGCAGACCAGTGTGCTCTCCGGCC 7 50 

TDG FGATGSQTSVLSG 
GGCTGTCGTACTTCTACGGTCTGGAGGGTCCGGCGGiCACGGTCGACACG 8 00 
RLSYFYGLEGPAVTVDT 
GCGTGTTCGTCGTCGCTGGTGGCGCTGCACCAGGCCGGGCAGTCGCTGCG 8 50 

ACS SSLVALHQAGQSLR 
CTCCGGCGAATGCTCGCTCGCCCTGGTCGGCGGCGTCACGGTGATGGCGT 900 

SGECST. ALVGGVTVMA 
CTCCCGGCGGCTTCGTGGAGTTCTCCCGGCAGCGCGGCCTCGCGCCGGAC 950 
SPGGFVEFSRQRGLAPD 
GGCCGGGCGAAGGCGTTCGGCGCGGGTGCGGACGGCACGAGCTTCGCCGA 1000 
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grakafgagadgtsfae 
gggtgccggtg7gctgatcgtcgagaggc7ctcc3acgccgaacgcaacg 10 50 

g . a g v l iverl3daern 
gtcacaccgtcctggcggtcgtccgtggttcggcggtcaaccaggatggt 1100 
5 g h t v l a v 7 r g s a v n q d g 

G'^CTCCAj^CGGGCTGTCGGCGCCGAACGGGCGGTCGCA.GGA.GC'jGGTGA.T 1 1 50 

ASNGLSAPNGPSQERVI 
CCGGCAGGCCCTGGCCAACGCCGGGCTCACCCCGGCGGACGTGGACGCCG 12 00 
RQALANAGLTPADVDA 
10 T C G AG G C C C AC G G C AC C G G C AGG AG GC ?G G GGG AG CC CAT C G AGG C AC AG 12 50 
VEAHGTGTRLGDPI EAQ 
GCGGTACTGGCCACCTACGGACAGGAGCGCGCCACCCCCCTGCTGCTGGG 13 00 

AVLATYGQERATPLLLG 
CTCGCTGAAGTCCAACATCGGCCACGCCCAGGCCGCGTCCGGCGTCGCCG 13 50 
15 SLKSNIGHAQAASGVA 

GCATCATCAAGATGGTGCAGGCCCTCCGGCACGGGGAGCTGCCGCCGACG 14 00 
G I I K M V Q " L R H G E L 1' ? T 
CTGCACGCCGACGAGCCGTCGCCGCACGTCGACTGGACGGCCGGCGCCGT I A 50 
G H A D E P S P H V 0 W T A G A V 
20 _ o AACT ^jCTGA.CGTCGGCC CGGCCGTGGC CCGAGACCG ACCGGCCACGGG 1^00 
ELLTSARPWPE7DRPR 
GTGCCGCCG7CTCCTCGTTCGGGGTGAGCGGCACCAACGCCCACGTCATC 15 50 
RAAVSS FGVSGTNAHVI 
CTGGAGGCCGGACCGGTAACGGAGACGCCCGCGGCATCGCCTTCCGGTGA 15 00 
25 GEAGPVTETPAAo PSGD 

CCTTCCCCTGCTGGTGTCGGCACGCTCACCGGAAGCGCTCGACGAGCAGA 1650 

LPLLVSARSPEALDEQ 
TCCGCCGACTGCGCGCCTACCTGGACACCACCCCGGACGTCGACCGGGTG 17 00 
I RRLRAYLD77PDVDRV 
30 GCCG7GGCACAGACGC7GGCCCGGCGCACACACTTCGCCCACCGCGCCG7 17 50 
AVAQTLARRTHFAHRAV 
GCTGCTCGGTGACACCGTCATCACCACA.CCCCCCGCGGACCGGCCCGACG 18 00 

LLGDTVI TTPPADRPD 
AACTCGTCTTCGTCTACTCCGGCCAGGGCACCCAGCATCCCGCGATGGGC 18 50 
35 ELVFVYSGQGTQH PAMG 

GAGCAGCTAGCCGA7TCG7CGG7GG7G77CGCCGAGCGGATGGCCGAG7G 1900 

EQLADSSVVFAERMAEC 
7GCGGCGGCG77GCGCGAG77CG7GGAC7GGGA7C7G77CACGG77C7GG 1950 
AAALREFVDWDLFTVL 
40 A7GA7CCGGCGG7GG7GGACCGGG77GA7G7GG7CCAGCCCGC77CC7GG 2 0 00 
DDPAVVDRVDVVQPASW 
GCGATGATGGTTTCCCTGGCCGCGG7GTGGCAGGCGGCCGGTGTGCGGCC 2 0 50 

AMMVS LAAVWQAAGVRP 
GGATGCGGTGATCGGCCATTCGCAGGGTGAGATCGCCGCAGCTTGTGTGG 2100 
45 DAVIGHSQGEIAAA.CV 

CGGGTGCGGTGTCACTACGCGATGCCGCCCGGATCGTGACCTTGCGCAGC 2150 
AGAVSLRDAARIVT LRS 
CAGGCGATCGCCCGGGGCCTGGCGGGCCGGGGCGCGATGGCATCCGTCGC 2 2 00 
QA IARG LAGRGAMASVA 
50 CCTGCCCGCGCAGGATGTCGAGCTGGTCGACGGGGCCTGGATCGCCGCCC 2 2 50 
LPAQDVELVDGAW I A A 
ACAACGGGCCCGCCTCCACCGTGATCGCGGGCACCCCGGAAGCGGTCGAC 2 300 
HNGPAS TVIAGTPEAVD 
CATGTCCTCACCGCTCATGAGGCACAAGGGGTGCGGGTGCGGCGGATCAC 2 350 
55 HVLTAH EAQGVRVRRIT 

CGTCGACTATGCCTCGCACACCCCGCACGTCGAGCTGATCCGCGACGAAC 24 00 

VDYASHTPHVELI RDE 
TACTCGACATCACTAGCGACAGCAGCTCGCAGACCCCGCTCGTGCCGTGG 2 4 50 
LLDITSDSSSQTPLVPW 
60 CTGTCGACCGTGGACGGCACCTGGGTCGACAGCCCGCTGGACGGGGAGTA 2500 
LSTVDGTWVDSPLDGEY 
C7GGTACCGGAACCTGCGTGAACCGG7CGG777CCACCCCGCCGTCAGCC 2550 

WYRNLREPVGFHPAVS 
AGTTGCAGGCCCAGGGCGACACCGTGTTCGTCGAGGTCAGCGCCAGCCCG 2 600 
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QLQAQGDT V F V E v S A S P 

G 7 G T 7 G 7 TG C AG G C G AT G G AC G AC G A 7 G T C i o 7 C AC G G 7 7 G C C A.C GCTGCG 2650 

7 7 L Q A M D 7 7 7 V T 7 A 7 L R 
7 C G T G A C G AC G G C G ACG C C A C C C G G A7 G C T C AC 3 3 C C C 7 G G CAC AG G C C 7 2 7 00 

R D D G D A T R M L 7 A h A Q A 
A7 GT C C ACGG C G T C ACC GT CG ACTG CC CC G C C A.T C C 7C CG C AC C AC CAC A 2 7 50 
Y V H G V T 7 D W P A I L G T T 7 
ACCCGGGTACTGGACCTTCCGACCTACGCGTTCCAACACCAGCGGTACTG 2 8 00 

7 R V L D L P T Y A F Q H 0 R Y W 
GCTCGAGTCGGCACGCCCGGCCGCA7CCGACGCGGGCCACCCCGTGCTGG 2 8 50 

L E S A R P A A S D A G H P V L 
GCTCCGG7ATCGCCCTCGCCGGG7CGCCGGGCCGGGTGT7CACGGGTTCC 2 900 
G5GIALAGSPGRVFTGS 
G 7 GC CG ACCG GT GCGGACCGCGCGGTGT7CG 7 CGCCG AG CTGGCGCTGGC 2 9 50 

7 F T G A D R A V F V A E L A L A 
CGGCGCGGACGCGGTCGACTGCGCCACGGTCGAGCGGC7CGACATCGCCT 30C0 

A A D A V D C A m V E R L D I A 
CC oTG ^CCGGCCGGCCGG GCC A7GGCC . TACGACC ^.-CACA^GA.CCTGG o1 C 3 050 
S V P R P G K G R 7 7 V Q T W V 

G ACG AG C CGG CG G AC G ACG G C CG G C G CC GG 7 T C AC C G 7 G CAC AC C CGC AC 3100 

3EFADDGRRRFTVHTR T 
CoGCo ACtjcCCCGTGGAovjjC i GC—.^^CCG.^ooG-.vG - io^T^C^-i^c^CCA^T'o 3 i50 

G D A P W 7 L H A E G 7 E R P H 
GCACGGCCCTGCCCGATGCGGCCGACGCCGAGTGGCCCCCACCGGGCGCG 32 30 
GTALPDAADAEWPPPGA 
GTGCCCGCGGACGGGCTGCCGGGTG7GTGGCGCCGGGGGGACCAGG7C7T 3 2 50 

V P A D G L P G V W R R G D Q V F 
CGCCGAGGCCGAGGTGGACGGACCGGACGGTTTCGTGGTGCACCCCGACC 3 300 

AEAEVDGPDGFVVHPD 
TGCTCGACGCGGTCTTCTCCGCGGTCGGCGACGGAAGCCGCCAGCCGGCC 3350 
LEDAV FSAVGDGSRQPA 
GGATGGCGCGACCTGACGGTGCACGCGTCGGACGCCACCGTACTGCGCGC 34 00 

GWRDLTVHASDATVLRA 
CTGCCTCACCCGGCGCACCGACGGAGCCATGGGATTCGCCGCCTTCGACG 34 50 

CLTRRTDGAMGFAAFD 
GCGCCGGCCTGCCGGTACTCACCGCGGAGGCGGTGACGCTGCGGGAGGTG 3 500 
GAGLPVLTAEAVTLREV 
GCGTCACCGTCCGGCTCCGAGGAGTCGGACGGCCTGCACCGGTTGGAGTG 35 50 

ASPSGSEESDGLKRLE W 
GCTCGCGG7CGCCGAGGCGG7C7ACGACGG7GACC7GCCCGAGGGACATG 3 600 

L A V A E A V D G D 1 ? E G H 

TCCTGATCACCGCCGCCCACCCCGACGACCCCGAGGACATACCCACCCGC 3 65 0 
VLITAAHPDDPEDI PTR 
GCCCACACCCGCGCCACCCGCGTCC7GACCCCCCTGCAACACCACCTCAC 3700 

AHTRATRVLTALQH HLT 
CACCACCGACCACACCCTCATCGTCCACACCACCACCGACCCCGCCGGCG 37 50 

TTDHT7.IVHTTTDPAG 
CCACCGTCACCGGCCTCACCCGCACCGCCCAGAACGAACACCCCCACCGC 3 8 00 
ATVTGtTRTAQNEHPHR 
ATCCGCCTCATCGAAACCGACCACCCCCACACCCCCCTCCCCCTGGCCCA 38 50 

I RLI ETDHPHTPLPLAQ 
ACTCGCCACCCTCGACCACCCCCACCTCCGCCTCACCCACCACACCCTCC 3900 

LATLDHPHLRLTH HTL 
ACCACCCCCACCTCACCCCCCTCCACACCACCACCCCACCCACCACCACC 3 950 
HHPHLTPLHTTTPPTTT 
CCCCTCAACCCCGAACACGCCATCATCATCACCGGCGGCTCCGGCACCCT 4 000 

PLNPEHAIIITGGSGTL 
CGCCGGCATCCTCGCCCGCCACCTGAACCACCCCCACACCTACCTCCTCT 4 050 

A G I L A R H L N H P K T Y L L 
CCCGCACCCCACCCCCCGACGCCACCCCCGGCACCCACCTCCCCTGCGAC 4 100 
SRTPPPDATPGTKLPCD 
GTCGGCGACCCCCACCAACTCGCCACCACCCTCACCCACATCCCCCAACC 4150 

VGDPHQLATTLTKIPQP 
CCTCACCGCCATCTTCCACACCGCCGCCACCCTCGACGACGGCATCCTCC 4 200 
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v g y s s a a a v g g s p g g g 
agtacggggccgcgaaggcgttcctcgacggggtggggaccgaggggga: .; 4 00 



10 



r A A A N A F L D A A A T G K H 
A G C C T C G G C C GG\ C C C G C C A C C T C C A T C G C G 7 G G G G C A ? G ? G G C A C A G C A 0 » 1 :>C 



G Q ? A T 3 0 A W G M W H 



15 




Phage K1C5 1 5 DNA was prepared usir._ d 



.Jure described in Genetic 



Manipulation of Streptomyces, A Laboratory Manual, edited by D. Hopwood <?/ a/. A 
phage suspension prepared from 10 plates (100 mm) of confluent plaques of KC515 on 

20 S. lividans TK24 generally gave about 3 fig of phage DNA, The DNA was ligated to 
circularize at the cos site, subsequently digested with restriction enzymes BamHl and 
Psth and dephosphorylated with SAP. 

Each module 8 cassette described above was excised with restnetion enzymes 
BgUl and Nsil and ligated into the compatible BamYLl and Pstl sites of KC515 phage 

25 DNA prepared as described above. The ligation mixture containing KC515 and various 
cassettes was transfected into protoplasts of Streptomyces lividans TK24 using the 
procedure described in Genetic Manipulation of Streptomyces, A Laboratory Manual 
edited by D. Hopwood et al. and overlaid with TK24 spores. After 16-24 hr, the plaques 
were restreaked on plates overlaid with TK24 spores. Single plaques were picked and 

30 resuspended in 200 [iL of nutrient broth. Phage DNA was prepared by the boiling 
method (Hopwood et al., supra). The PGR with pnmers spanning the left and right 
boundaries of the recombinant phage was used to verify the correct phage had been 
isolated. In most cases, at least 80% of the plaques contained the expected insert. To 
confirm the presence of the resistance marker (thiostrepton), a spot test is used, as 

35 described in Lomovskaya et al. (1997), in which a plate with spots of phage is overlaid 
with mixture of spores of TK24 and phiC31 TK24 lysogen. After overnight incubation, 
the plate is overlaid with antibiotic in soft agar. A working stock is made of all phage 
containing desired constructs. 



40 5 Apr 1966, incorporated herein by reference) mycelia were infected with the 

recombinant phage by mixing the spores and phage (1 x 10 8 of each), and incubating on 
R2YE agar (Genetic Manipulation of Streptomyces, A Laboratory Manual, edited by D. 



Streptomyces hygroscopicus ATCC 14891 (see US Patent No. 3,244,592, issued 
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Hopwood et aL) at 30°C for 10 days. Recombinant clones were selected and plated on 
minimal medium containing thiostrepton (50 ng/ml) to select for the thiostrepton 
resistance-conferring gene. Primary thiostrepton resistant clones were isolated and 
purified through a second round of single colony isolation, as necessary. To obtain 
5 thiostrepton-sensitive revertants that underwent a second recombination event to evict 
the phage genome, primary recombinants were propagated in liquid media for two to 
three days in the absence of thiostrepton and then spread on agar medium without 
thiostrepton to obtain spores. Spores were plated to obtain about 50 colonies per plate, 
and thiostrepton sensitive colonies were identified by replica plating onto thiostrepton 

10 containing agar medium. The r "^ u was "sed to determine which of the thiostrepton 

sensitive colonies reverted to the wild type (reversal of the initial integration event), and 
which contain the desired AT swap at module 8 in the ATCC 14891 -derived cells. The 
PCR pnmers used amplified either the KS/ AT junction or the AT/DH junction of the 
wild-type and the desired recombinant strains. Fermentation of the recombinant strains, 

15 followed by isolation of the metabolites and analysis by LCMS, and NMR is used to 
characterize the novel polyketide compounds. 

Example 2 

Replacement of Methoxyl with Hydrogen or Methyl at C-13 of FK-506 

20 The present invention also provides the 13-desmethoxy derivatives of FK-506 

and the novel PKS enzymes that produce them. A variety of Streptomyces strains that 
produce FK-506 are known in the art, including S. tsukuhaensis No. 9993 (FERM BP- 
927), described in U.S. Patent No. 5,624,852. incorporated herein by reference; S. 
hygroscopicus subsp. yakushimaensis No. 7238, described in U.S. patent No. 4,894,366, 

25 incorporated herein by reference; S, sp. MA6858 (ATCC 55098), described in U.S. 

Patent Nos. 5,1 16,756, incorporated herein by reference; and S, sp. MA 6548, described 
in Motamedi et a/., 1998, 'The biosynthetic gene cluster for the macrolactone ring of the 
immunosuppressant FK-506; 1 Eur. J. Biochem. 256: 528-534, and Motamedi et al, 
1997, "Structural organization of a multifunctional polyketide synthase involved in the 

30 biosynthesis of the macrolide immunosuppressant FK-506," Eur. J. Biochem. 244: 74- 
80, each of which is incorporated herein by reference. 

The complete sequence of the FK-506 gene cluster from Streptomyces sp. 
M A6548 is known, and the sequences of the corresponding gene clusters from other FK- 
506-producing organisms is highly homologous thereto. The novel FK-506 recombinant 

35 gene clusters of the present invention differ from the naturally occurring gene clusters in 
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that the AT domain of module 8 of the naturally occurring PKSs is replaced by an AT 
domain specific for malonyl CoA or methylmalonyl CoA. These AT domain 
replacements are made at the DNA level, following the methodology described in 
Example 1 . 

The naturally occurring module S sequence for the MA6548 strain is shown 
below, followed by the illustrative hybrid module 8 sequences for the MA6548 strains. 

GCA7GCGGCTGTACGAGG7GGCA7GGCG7AC7GGAAGTCC7GTG37;;G7G 5 0 
M R L Y E A A R R T G S P V 7 V 
GCGGCGGCGGTCGACGAGGCGCCGGACGTGG GGG7GCTG CGCGGG77 G7G 100 

AAALDDAPD V P L L R G 7 R 
GGGTA7GAGCGTGCGGGG7GGCGGCG7GGGGG AA C G C T C T C T C G 7 7 7 A C C 15 0 

R T T V R R A A V R 7 R S L A 7 
v_ :^TCG 7 GGTGCTGCCCGACGACGAGCGCGG7G.--.V ,-o 7 77 ~ 7 77 w ~ ^ - ~ ..^ 200 
RSPCCPTTSAP7PFSRS 
7 C C ? G G AA C A G C A C C G C C A. C G G T G C T 7 G G 7 7 A 7 7 7 G 7- G 7 G 7 7 G A A : A 7 A T 250 

3 W N S T A T V I, G K L G A E 7 I 
C C C G G C G A C G A C G A C G T 7 G AA G G AA 7 T 7 G G 7 A 7 C G A 7 7 7 G 7 T 7 A 7 7 3 7 G G 300 
P A 7 T T F K E L G I 7 7 L 7 A 

T r " P *\ " "" 1 <^ ^ TV ^ ^ T> ^ A r ^ /— ^ -> 7; ^ ^ .-~ r~ rr i^^^r.r'f"^ - • T ^ A 

1 v_ Lr.'J^ - oV- o^rLn ^oLvj^ - vjr\t- ^rt ^ o *j v, ^j.'-. v.. — v.: v: o . . . _ ^: ^ ^- » _ . , ... ^ _< o 

V Q L R N A L T 7 A T G V R 7 N A 
ACAGCGG7CTTCGAC7T7CCGACGCCG7GCG7G77CGCCGCGAGA7773G hCO 

T A V F D F P T P R A L A A R 7 G 
CG ACG AGCTGGCCGG 7 A\CC CGCGCGC 7 CGTCGCGGCCCGGACCG73G 77 A 4 50 

D E L A G T R A P V A A R T A A 
CCGCGGCCGCGCACGACGAACCGCTGGCGATCGTGGGCATGGCCTGCCGT 500 
TAAAH DEPLAIVGMACR 
CTGCCGGGCGGGGTCGCGTCGCCACAGGAGCTGTGGCGTCTCGTCGCGTC 550 

L P G G V A S P Q E I W R L V A S 
CGGCACCGACGCCA7CACGGAGTTCCCCGCGGACCGCGGCTGGGACGTGG 600 

G T D A I T E F P A D R G W 7 V 
ACGCGCTCTACGACCCGGACCCCGACGCGATCGGCAAGACCTTCGTCCGG 650 
DALY D P DPDAIGKTFVR 
CACGGCGGCTTCCTCGACGGTGCGACCGGCTTCGACGCGGCGTTCTTCGG 7 00 

HGGFLDGATGFDAAFFG 
G A 7 G A G C C 7 G C G C G AjG G C C C T G G C C A T G G A C 7 C G 7 A G C AA 7GGG7G77G7 7 50 

I 5PREALAMDPQQ R V 7 
TGGAGACG7CCTGGGAGGCGTTCG AAA. G 7 G G G G G 7 o T G A C 7 7 7 o .-.C^-u 8 00 
L E T S W E A F E S A G I T P 7 A 
GCGCG G GGC AGC G AC AC CGGCG7G7TC A7 CG GC G 7 G T 7 C 7 C77 A7 G 7 G 7 A 8 50 

ARGS DTGVFIGAFSYGY 
CGGCACGGGTGCGGATACCAA.CGGCTTCGGCGCGACAGGGTCGCAGACCA 900 

GTGADTNGFGATGSQT 
GCGTGCTCTCCGGCCGCC7CTCGTACTTCTACGGTC7GGAGGGCCCTTCG 950 
SVGSGRLSYFYGL^GPS 
GTCACGGTCGACACCGCCTGCTCGTCGTCACTGGTCGCCCTGCACCAGGC 1000 

VTVDTACSSSLVALHQA 
AGGGCAGTCCCTGCGCTCGGGCGAATGCTCGCTCGCCCTGGTCGGCGGTG 10 50 

GQSLRSGECSLALVGG 
TCACGGTGATGGCGTCGCCCGGCGGATTCGTCGAGTTCTCCCGGCAGCGC 1100 
VTVMAS PGGFVEFSRQR 
GGGCTCGCGCCGGACGGGCGGGCGAAGGCGTTCGGCGCGGGCGCGGACGG 1150 

GLAPDGRAKAFGAGADG 
TACGAGCTTCGCCGAGGGCGCCGGTGCCCTGGTGGTCGAGCGGCTCTCCG 1200 

TSFAEGAGALVVFRLS 
ACGCGGAGCGCCACGGCCACACCGTCCTCGCGC7CGTACGCGGCTCCGCG 1250 
DAERHGHTVLALVRGSA 
GCTAACTCCGACGGCGCGTCGAACGGTCTGTCGGCGGCGAACGGCCCCTC 1300 

ANSDGASNGLSAP NGPS 
CCAGGAACGCGTCATCCACCAGGCCCTCGCGAACGCGAAACTCACCCCCG 1350 
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QERVI HQALANAKLTP 
C C G ATG 7 CG ACG CGG T C G AG G C G C ACG G C AC CG G CACCCG CC T CGGCG AC 14 00 



D A V E A f: G T G 7 R L G D 



1 4 5 C 



5 PIE A Q A L C A G V G Q L; R A 7 

? L L L G S L K G K I G H A Q A 
C - u T ^. t -^G o G T o _ u o o *7 C .^*. T G ."--.G Tu^ i o . ^ ."*. >^/C A. - ~ G o C t ^ u ^ o 1 3 d 0 
A S G V A G I IK K V Q A I R H G 
10 GAACTGCCGCCGACAC7GCACGCGGACGAGCCGTCGCCGCACGTCGACTG 1600 

E L P F T L H A C E ? S ? H V D W 
G _ G G G o o C G C G 7 ^ G/-.«j C T C C7 o A. ^ j GCw o ^ ^^GGCCGTGo^^u'o j -.j.--. i d d0 

7AGA7ECETSARPWPG 
(J GGG7GG CCCGCGCCGCGCTGCCG7 CTCGTCG77CGGCo7GAGCGGC~.CG i 1 0 0 
15 T G R ? R R A A V 5 S F G V S G 7 

AA.CG ^ C C.-w^A CAT CGTGGG.G^CAGo AGGGo7G.^."Gl^GGGvjACCG'oTG'o.^. _ < 50 
N A H I I L E A G P V K T G P Y E 

A GATE A G ? V G V G P V E A 
20 GACCGC7CCCCGCGGCGCCGCCG7CAGCACCGGGCGAAGACC77CCGC7G 18 50 

G P 7 P A A ? ? S A P G E G G P G 
7 Co r'vjC^'jG^uGjl 7 - 'jGrGG-AC^oA 1 .. .j.-'.ooA^r. t vUviow'o ~ i ^00 
t '■ ! 3 z* h c p v p. " " ^ ^ "* G 

/■ - ,-1 (— ™ T m ,Tn ^ t '- - ^ — f ^ ~> /— — — <— .~« ^ - ^ i— ^ " ^ ' r* «i -* ^ i - < ^ 

...j ■„ v; . . 'w - ^. .j ^ OVJ'v ^ v- o o ^ : ^ O - ^ -j.-. ~ - O O ~ — w O i O ^. 'O ; j Jb 

25 R A Y G G 7 G ? G V 7 P A A 7 A 

Ajr.^ .-',C 7 o G C C C G G G G 7 - 1 * ^ -o w A G 7 'I G ^Cr.C o o o'w^.G i . m .C i uwTCGG ^000 
QTL^RRTHFTHR A V G L G 
GhGAGGG x 1 C^i7CGGGCuC7GCGCCGGCGGA J .CC.'-'.--'GGGGA.C^-A ; -iG7iwoTL .1 2 0b0 
G 7 V I G A P ? A D Q k D Z L V F 
30 CG7C7AC7CCGG7CAGGGCACCCAGCA7CCCGCGATGGGCGAGCAAC7CC 2100 

V Y SGQGTQH PAMGEQL 
CGGCCGCGT^CCCCGT^TTCGCCGkTGCCTGGCkZGACGCGCTCCG^CG^ 21 5C 
A A A F P V F A D A W H D A L R R 
C 7 C G AC G A C C C C G AC C C G C A C G AC C C C A C AC G G AG C C AG C AC ACGCTC77 22 00 
35 LDDPDPKDFTRSQHTLF 

CGCGCAGGAGGCGGGG77GACCGGCC7GC7GAGG7GC7GGGACATCACGG 2 2 5 0 

A H Q A A F T A LLPS W D I 7 
C GC ACG C C G T C A7 CGGCG .ACT C G C 7 C G G CG AG AT G AC CGCCGCGT AC GC C 2300 
PHAVIGHSLGEITAAYA 
40 GCCGGGATCCTGTCGC7CGACGACGCC7GCACC:7GATCACCACGCG7GC 2 3 50 

A G I L S L G D A C T L I T T R A 
GGGCGTCATGCACAGGC77GGGCCGGCCGGGG:GA7GGTCACCG7GC7GA 24 00 

R L H 7 L P ? ? G A M V T V L 

CCAGC G A.GGAGGAGGCCCGTC.YGGC GC7 GCG G G - uGo GG7G GAG A7CG CC 2 '1 50 
45 7 S E E E A R Q A L R P G V E I A 

GCGGTCT7CGGCCCGCACTCCG7CG7GCTC7CGGGCGAC r ' 2GACGCGG7 2500 

A V F G ? K S V V L S G D E D A V 
GCTCGACGTCGCACAGCGGCTCGGCATCCACCACCGTCTGCCCGCGCCGC 2 550 
LDVAQRLGIHHRLPAP 
50 ACGCGGGCCACTCCGCGCACATGGAACCCGTGGCCGCCGAGCTGCTCGCC 2 600 

K A G H 5 A H K E P V A A E L L A 
ACCACTCGCGAGCTCCGTTACGACCGGCCCCACACCGCCATCCCG AA C G A 2650 

77 RELRYDRPHTAIPND 
CCCCACCACCGCCGAGTACTGGGCCGAGCAGGTCCGCAACCCCG7GCTG7 27 00 
55 PTTAEYWASQVRNPVL 

TCCACGCCCACACCCAGCGGTACCCCGACGCCGTGTTCGTCGAGATCGGC 27 50 
FKAHTQRYPDAVFVEIG 
CCCGGCCAGGACCTCTCACCGCTGGTCGACGGCATCGCCCTGCAGAACGG 28 00 
PGQDLS PLVDGIALQNG 
60 C ACG G CGG ACG AGG T G C AC G C G C 7 G C AC A C G G C G C 7 C G C C C G C C T CT T C A 2850 

7 A D E V H A G \l 7 A L A R L F 
GP\GGCGGGGCCzGGG7GGACTGGGCCGGGA7GZ r T2GGCGG r ?GCTTGGGGG 2 900 
TRGATLDWSRILGGASR 
CACGACCCTGACGTCCCCTCGTACGCGTTCCAGCGGCGTCCCTACTGGAT 2 95 0 
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HDPDVPSYAFQRRPYWI 

CGAGTCGGCTCCCCCGGCCACGGCCGACTCGGGCCACCCCGTCCTCGGCA 30 00 

E S A P P A T A 7 S G H P 7 L G 
C ^ GG AG 7C G CC G T C G C C G G GT C GC C GG 3 3 3 3 5 3 T G 7 7 7 AC GGG T C C C G T G 3 05 0 
T G 7 A 7 A G S P G F 7 F 7 G ? V 

PAGADRA7FIAE3A7AA 
CGCCGACGCCACCGACTGCGCCACGGT33AA3AGCT3GA3 3TCACCTCCG 3150 

A D A T D C A T V E Q L 7 7 7 S 
7 G CC C G G G G G AT G G G C C G G G G G GAG G G 3 G A C G G G G C AG A G G T G G G 7C G A T j 2 0 0 
V P G G S A R G R A TACT W V D 
GAAGGGGGCGGGGAGGGGCGGGGCGGG77GAGGG7CCAGAGGGGCG7GGG 32 50 

E P A A D G R R R F T V 1 i 7 R V G 

^ A^ G ^ G^: T ^j'or.^bCT G Av_. GCC c.-.^-jou j P 3 ^. 7C' - '.j ^ ~ o oG^ ! „) „. o j 3 0 0 
GAP W T L H A l G 7 7 R P G R 
TGCCCCACCCCGAAGCCGTCGACACCGCCTGGCCCCCG7 3GGGCGCGGTG 3 3 50 
7 P Q P E A V D T A W P P ? G A 7 
CCCGCGGACGGGCTG CCCGGGGCGTGGCGACGCGCG " " ' 3AGGTCT7CGT 34 0 0 

PADGLPGA W R R A Q V F V 

CGAAGCCGAAGTCGACAGCCCTGACGGCTTCG7GGCA3ACCCCGACCTGC 34 50 

E A E V D S P 0 G F V A H P D I, 
T C G A C G C G G T C T T C 7 C C G C G G T C G G C G A 7 G G G A G C G 3 G C: A G C C G A C C G G A 3 5 00 
L 7 A V F S A V G D G S R 7 ? T G 
TGGCGCGACC7CGCGG7GCA7GCG7CGGAC333ACCG7G3TG3GCGCC7G 3 5 50 

W R D 7 A 7 H A S 3 A 7 7 7 R A C 
CCTCACCCGCCGCGAC AG 7 G G 7 G 7 G G 7 G G A. G C T C G C C G C C T T C G AC G G T G 3 600 

LTRRDSGV V E L A A F D G 
CCGGAATGCCGGTGC7CACCGCGGAGTCGGTGACGCTGGGCGAGGTCGCG 3 650 
AGMPVLTAES V T L G E V A 
TCGGGAGGCGGATCCGACGAGTGGGAGGG7CTGCTTCGGG7TGAGTGGTT 37 0 0 

SAGGS DESDGLLRLEWL 
GCCGGTGGCGGAGCCCCACTACGACGGTGCCGACGAGCTGCCCGAGGGCT 37 5 0 

PVAEAHYDGADELPEG 
AC AG G G 7 G AT C .AC C G C C AC AC ACC C C G AG GAG G C CG AC G AC G C G ACC AAC 38 00 
YTLITATHPDDPDGPTN 
C C C C A C AA C A C A C C C A C A C G C A C C G AG A C AC AAA C C A C A C G C G T C C T C A C 33 50 

P H N 7 P T R T H 7 Q 7 7 R V L T 
C G C C G 7 C C AAC ACC A C CT C AT GAG C AC C AAC C AC AC C G 7 G A 7 C G T G C AC A 3 90 0 

A L Q H K L I T T N* H 7 L 3 V K 
G C AG G .AC C G AC C CC C C AG GC G C C G G GG 7 G A 3 3 3 G G G T G A G G G G G AC C G C A 3950 
T T T D F ? G A A '7 T G 7 T R T A 
C AAAAC G AAC AC C C C G G C C G C A T C C AC C T G A 7 C G AAA C 3 C A C C A C C C C C A. 4 000 

Q N E K ? G R I H L I E 7 H H P K 
C AC C C C AC T C C C C C T C AC C C AAC T G AG GAG C G 7 C G A G G AA C G C C AC G T A C 4 05 0 

TPLPLTQLTTLHQPHL 
G C C T C AC C AAC AAC AC CC T CC AC AC C G G G C AC C T C AC G G C G AT C ACC AG C 4100 
R L T N N T L H T P H L T ? I T T 
CACCACAACACCACCACAACCP.CCCCC A^CACCCCACCCCTCAACCCCAA 415 0 

HHNTTTTTPNTPPLNPN 
CCACGCCATCCTCATCACCGGCGGCTCCGGCACCCTCGCCGGCATCCTCG 4 2 00 

HAILITGGSGTLAGIL 
CCCGCCACCTCAACCACCCCCACACCTACCTCCTCTCCCGCACACCACCA 4 2 50 
ARHLNHPHTYLLSRTPP 
CCCCCCACCACACCCGGCACCCACATCCCCTGCGACCTCACCGACCCCAC 4 30 0 

PPTTPGTHIPCDLTDPT 
CCAAATCACCCAAGCCCTCACCCACATACCACAACCCCTCACCGGCATCT 4 35 0 

QITQALT HIFQPLTGI 
TCCACACCGCCGCCACCCTCGACGACGCCACCCTCACCAACCTCACCCCC 4 4 00 
FRTAATLDDATLTMLTP 
CAACACCTCACCACCACCCTCCAACCCAAAGCCGACGCCGCCTGGCACCT 4 4 50 

QHLTTTLOPKADAAWHL 
C C AC C AC C AC AC CC AA-AAC C AAC C C C T C AC C C AC TT C G T C C T C T AC T C C A 4 50 0 

H H K T Q N Q P 7 7 K F V L Y S 
GCGCCGCCGCCACCCTCGGCAGCCCCGGCCAAGCCA^CTACGCCGCCGCC 4 550 
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S A A A T L G S ? G Q A N Y A A A 
AACGCCTTCCTCGACGCCCTCGCCACCCACCGCCACACCCAAGGACAACC 4 6 00 

N A F L D A L A T H R H 7 Q G Q ? 
CGCCACCACCATCGCCTGGGGCATGTGGCACACCA3CACCACACTCACCA 4 6 50 
A T T I A VJ G M W H T 7 T T L T 

s g l t d s : r dp::? r g g r g 



I S D D E G 



The AvrU-Xhol hybrid FK-506 PKS module 8 containing the AT domain of 
module 12 of raparnycin is shown below. 

'■j-^..-*. i i.-^ijoL * o j r-v ( o r i <o ^ u u L r t ^ o v; ^ ^ ' J j - — Lj t i Obi * *j J 

M R L V F A A R R T G 3 ? V V V 
^CGGCCG CGCCCG ACGAGGCGCCGGACGTGC GGCT G 77 1 GCGCGGGC : GGo _ J \j 

A A L D G A ? D V P T . L R 

■ j G T A C ^ AC C G T ^ C G G C G T ^ C C G C C G T C C G G ^ A~rv- o ^ T T G 1 ^vj^-^^:.^'^ ±. w 

R T T V R R A A V R £ ? S L A D 
GG i'v. oGCGTGGTGCCGGACGr\Go.-.GGoGGGGor\GG^. ^TCCCTCGCGTTC'o ^' J ... 
R 3 P C C PTTSAPTFPSRS 
TCG'i uGAACAGC.sCCGCC^CCGT GCCCGGCCACCT' ^- G j GGCCoAAj^k^G- I 2- 0 
3 V: K S T A T V G G :i 1 G A £ D I 

? A T T ? F K F L G I . Z 3 L T A 

V Q L 'r N A L T T A T G v R L N A 
ACAGCGGTCTTCGACTCTCCGaCGCCGCGCGCGCT 3 G CCGCG f\GACTCGG A 0 0 

T A V F D F P T P R A L A A R G G 
CG AC G AGCT GG C CGG T AC CC GCGCGCCCGTCGCG 3 3 3CGGACCGCGGCC A 4 5 0 

D £ L A G T R A P V A A R T A A 
CCGCGGCCGCGCACGACGAACCGCTGGCGATCGTGG3CATGGCCTGCCGT 500 
T A A A H DFPLAIVGMACR 

L P G G V A S 9 Q E L W R L V A S 
GGGGAGGGAGGCCATCACGGAGTTGGGCGCG3ACG3:GGGTGGGAGGGGG 600 

G T D A I T S F ? A D ?. G W 0 V 
ACGCGCTCTACGACCCGGACCCCGACGCGATCGGC;-.».GACCTTCGTCCGG 6 50 
3 A G Y D P D ? D A 7 G K T F V R 
C A C o u 3 G i o T C ^ * AC i uCor.'^b'jbC - Gur^j^cu^G . . C i 4. ^ ^.^ G 7 'j ^ 

H G G F L D G A T G F 3 A A r F G 
G AT G A G G CC G CG C GAGGC C CT GG CC AT G G AC C G G 3 A 7 C AA.CG G GT GCT CC 7 50 

I S ? R F A L A K 0 P 2 Q R V L 
! vJo3iuAGb7Tu^TvjGoAGkjrC^TT^u^'HA.GGoCo-j: : ^r * — ^^C^uACo^c ^ u b 
LETS V, 1 RAFFS A GIT PDA 
GCGCGGGGCAGCGACACCGGCGTGTTCATCGGCGCGTTCTCCTA.CGGGT A 6 50 

A R G S 0 T G V F I G A F S Y G Y 
CGGCACGGGTGCGGATACCAACGGCTTCGGCGCGAOAGGGTCGCAGACCA 900 

GTGADTNGFGATGSQT 
GCGTGCTCTCCGGCCGCCTCTCGTACTTCTACGGTCTGGAGGGCCCTTCG 950 
SV1SG RGSYFYGGEG PS 
GTCACGGTCGACACCGCCTGCTCGTCGTCACTGGTCGCCCTGCACCAGGC 100 0 

V T V D T A C S S S L V A L H Q A 
AGGGCAGTCCCTGCGCTCGGGCGAATGCTCGCTCGCCCTGGTCGGCGGTG 1Q50 

GQSLRSGECSLA LVGG 
TGACGGTGATGGCGTCGCCCGGGGGATTCGTGGA3TTGTCCCGGCAGCGC 1 100 
VTVMASPGGFVFFSRQR 
GGGC7CGCGCCGG^CGGGCGGGCG?JKGGCG'ZrCG2CGCGGGCGCGGACGG 1150 

GLAPDGRAKAFG A G A D G 
TACGAGCTTCGCCGAGGGCGCCGGTGCCCTGGTG3TCGAGCGGCTCTCCG 1200 

T S F A E G A G A L V V E R L S 
ACGCGGAGCGCCACGGCCACACCGTCCTCGCCCTCGTACGCGGCTCCGCG 1250 
D A E R H G H T V L A L V R G S A 
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GC7AACTCCGACGGCGCG7CGAACGGTCTGTGGGGGCCGAACGGCCCGTC 1300 
A N 5 0 G A S N G 7 5 A P N G P S 

Q ' E R V I H Q A 7 A N A K L T P 



G D 



P I r_ A 0 A 7 L A T V GO DRAT 



GCCCCTGC 



* 1 ^- ^ O - vj ^ --VO O ^ v. 

I G H A Q A 



10 P L L 7 G S I K S N I G H 

C G T C A G G G G T C G C C G G G A T C A T C A A G A T G G T G G A G G C C A T C C G G C A C G G G 1 55C 
A S G V A G I I K M V 0 A I R H G 
GAAGTG0OG0GGAGAGTGGA0GGGGAGGAGGCGTGGCOGOACGTCGACTG 1600 
E 1 ? F T 7 H A L; E P 5 P H V D W 
1 5 GACGGCG7GTGC237GGAGC7CG73AGG7CGGCGCGGCCGTGGCCGCGGA 1650 
T A G A V E L L 7 S A R P W P G 

T G A P ;. A G V S S F G I 6 G . 

PiSKCCtQC^Z*-* OoTCATCGTGGAAAGCG AACCCCC^.' vGl Cnu C C T >c ^'ounCr^. / ^ v 
20 N A H V I L E 5 A P P T Q ? A D N 

CGCGGTGA1 CGAGCGGGCACCGGAGTGGGTGGCGTTGGTGA1 TTCGGCC^ Ig^O 

A V I E R A ? E W V P 0 V I S A 
GG^.CGCaGTGGGCTTT GAGTGAGGAGGAGGGG'^GGTTGCvjTGCo'I AT ; wTG 1 s 5 0 
RT0SAL7EHEGRLRAYL 
25 oCGo^'j - ^ j7G jATP. A; ^ jj jC .G o ^ - 17- or\ - G ^ . o-o^u^. - 1 S'.'J 

A A S P G V D >: R A V A S 7 1 A X 

GACACGGTCGG7G7TCGAGCACCG7GCCGTGC7GCTGGGAGATGACACCG 135 0 

T R S V F E H R A V 7 D G DDT 
TCACCGGCACCGCTG7G7CTGACCC7CGGGCGG7GTTCGTCTTCCCGGGA 2 00 0 
30 V T G T A V S D ? R A V F V F ? G 

CAGGGGTCGCAGCG7GCTGGCATGGGTGAGGAACTGGCCGCCGCGTTCCC 2 05 0 

Q G SQRAGMGEELAAAFP 
CGTCTTCGCGCGGATCCA7CAGCAGGTGTGGGACCTGCTCGATGTGCCCG 2100 
V F A R I H Q Q V W D L L D V P 
35 ATCTGGAGGTGAACGAGACCGGTTACGCCCAGCCGGCCCTGTTCGCAATG 2150 
DLEVNETGYAQFALFA M 
CAGGTGGCTC7G7TCGGGC7GCTGG.AA7CG7GGGGTG7ACGACCGGACGG 2200 
QVALFGLLESWGV R P D A 

40 V 1 G K S V G E L A A A Y V S G 

TGTGGTCGTTGGAGGATGCCTGCACTTTGGTGTC^GCu^uGoCT-_GTCTG 2 3 l 1 0 
V W S L E D A G 7 7 V S A R A R L 
ATGCAGGCTCTGCCCGCGGG7GGGG7GA7GG7CGGTGTCCCGGTCTCGGA 2 35 0 
M Q A L ? A G G V /j V A V P V S E 
45 GGATGAGGCCCGGGCCGTGC7GGG7GAGG G7GTGGAG ATCoCCGCGGTCA 2 4 0 0 
D E A R A V L G E G V E I AAV 
ACGGCCCGTCGTCGGTGGTTCTCTCCGGTGATGAGGCCGCCGTGCTGCAG 2 4 50 
NGPSSVV7SGDEAAVLQ 
GCCGCGG AGGGGCTGGG G AAGTGG ACGCGGCTGG ~ ~ACC AC"' 7ACGCGTT 2500 
50 AAEGLGKW7RLAT S H A F 

CCATTCCGCCCGTATGGAACCCATGCTGGAGGAGTTCCGGGCGGTCGCCG 2 5 50 

HSARMEPMLEEFRAVA 
AAGGCCTGACCTAGCGGACGCGGGAGG7C7CCATGGCCGTTGGTGATCAG 2 600 
E G L T Y R 7 P Q V S M A V G D Q 
55 GTGACCACCGCTGAG7AC7GGGTGCGGCAGGTCCGGGACACGGTCCGGTT 2 650 
VTTAEYWVRQVRDTVRF 
CGGCGAGCAGGTGGCG7GGTACGAGGAGGGCGTGTTCGTCGAGCTGGGTG 2 7 00 

GEQVASYEDAVFV ELG 
CCGACCGGTCACTGGCCCGCCTGGTCGACGGTGTCGCGATGCTGCACGGC 27 50 
60 ADRSLARLVDGVAMLHG 

GACCACGAAATCCAGGCCGCGATCGGCGCCCTGGCCCACCTGTATGTCAA 2 8 00 

DHEIQAATGALAHLYVN 
CGGCGTCACGGTCGACTGGCCCGCGCTCCTGGGCGATGCTCCGGCAACAC 28 50 
G V T V D W P A 1 L G D A PAT 
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G G G 7 G C 7 G G A C C 7 7 C C G A C A T A C G C G 7 7 C C A. G C A-. C C A G G G C 7 A C 7 G G C T C 2 900 
R V L D L P 7 Y A F Q K Q R Y w L 
GAGG'GGGCGGCCCCGGCCACGGGGGA^CGCGGGCCACC GG G7GC7CGGCAC 2 95 0 



? ? A T A 0 S G H P V 




GC^CG ^C GG.--.7 CC GCCC v> v , vruCn^'co'- ~ G G G ^ - j I 5 0 



P G G S A R G R A T A Q 7 W V D 

AACCCGCCGCCGACGGGCGGCGGGGC7TCACGGTGGACAGCCGCG7GGFG 3200 
F P A A D G P. R R F T V H T R V G 

15 j C 1 j C ^ „ w G 1 -.J vj r. w G i - T j G- ... .-■ ^ o j o vj j j T ■_• * ^'..'jC-CCoo^'v'j'- j i j250 
C A P V: 7 G H A F G V G R ? G R V 

, ^, . ~ — ^, . — ^ ^ - . ^ - , r - ^ ^ ^ ^ — ^ — ^ ^- ~ — ^ rp — ^ O O n A 

? C F F A 7 FTAV'P P ? G A 

Gv_, o'cA'^'.jG wG7^:C GC o ; j o o G G T G C ij A C 0 C G ■ o A C . m . G - j - ; T ^. -.j - ^ 3 J 5 u 
20 P A D G G P G A W R R A G Q V F 7 

GAAGCCGAAG7CGA,CAGCCC7GACGGCT7CGTGGCACACCCCGACG7:;G7 3 4 00 

F A F V D S P D G f V A ri P D G L 

CG^vG GC G G7 C77C7CC GC GG 7 GGG G G.AGGGG A.GC G'o CG.- 1 . o ; -CG A. uCuj: -.1 J 4 o 0 




L 7 R R D 3 G 7 V E F A A F D G A 
G G G AA* 7 G C G G G T G G T C A C C G C G G A G^ T C G G T o j4 C o G T G 'o ^ G G G T G G C o T 3 600 
30 G M P V L 7 A F S V T L G E V A 

CG GC AG G C G G A 7 C CG AGG AG 7 G GG AGGG 7C 7 GC7TGGGC7 T G AG TGG77G 3650 
5 A G G S D F S D G L L R L E W L 
GGGG7GGCGGAGGCCCAG7AGGACGGTGGGGACGAGG7GGGGGAGGGC7A 37 00 
? V A F A H YDGADELFFGY 
35 C A C C C 7 C A T C A C C G C C A C A C A C C G G G A C G A C C C C G A C G A C C C C A C C AA C C 37 50 
7 7 I T A T KPDDPDDPT K 
CGCAG7VAGAGAGGGAGAGGGAGGGAGACACAA.AGCAGAGGGG7GG7CAGG 3800 
? }i M T ? 7 R T H 7 Q 7 T R V L T 
G C C G 7 C G A.A G A G C AC C T C A7 C A C G AC C AA C C A C A G G G ^ C A T C G 7 C G A C AC 38 50 
.40 A 7 Q H H L I 7 7 N K T 7 I V H 7 

CACCACCGAGCCCCCAGGCGCCGCCG7CACCGGCC7CACCCGGAGCGCAC 3 900 

7 7 D P P G A A V T G G 7 R 7 A 
AA JAA G G AA C A C C C C G G C C G C A 7 C C A C C T C A 7 C G AAA. CCCACCACCCCC A G 3950 
QNEHPGRIHGIFTHHPH 
45 ACCCCA.C7CCC GC7CACCCAG^C7CA\GGACCC7CCA.C CAGAGCCCACC7ACG 4 00 0 
7 P L ? G T Q G 7 7 L H Q ? H L R 
G G T G A G C AA G A*. A G A G C G 7 C C A G A G G C G G C A C C T C A G G C C C A T C A G G A C C C 4 050 

GTNNTLH7PHLTPITT 
AC G A C AAC AG G AC G AG AA C C A GC C G G AA.C AC C G C A G CG C T C AA C C C C AAC 4100 
50 H H G T T 7 7 7 P M T P P - G ? N 

CACGCCATCCTCATCACCGGCGGCTCCGGCACCCTCGCCGGCATCCTCGC 4150 

HAILITGGSGT7AG1LA 
CCGCCACC7CAG\CCACCCCCACACC7ACC7CC7C7CCCGCACACCACGAC 4 200 
R H L N H P H 7 Y L L S R T P P 
55 CCCCGACCACACCCGGCACCCACATCCCCTGCGACCTCACCGACCCCACC 4 25C 
PP7TPG7HIPCDLTDPT 
CAAATCACCCAJ^GCCCTCACCCACATACCACAACCCCTCACCGGCATCTT 4 300 

QITQAL7HIPQPLTGIF 
CCACACCGCGGCCA.CCCTCGACGAGGGCAGCC7CA.CCAA.ee TCACCCCCC 4 3 50 
60 K 7 A A 7 LDDATLTN L T P 

AACACCTCACCACCACCCTCCAACCCAAAGCCGACGCCGCCTGGCACCTC 4 4 00 
QKLTTTLQPKADAAWHL 
CACCACCACACCCAAAACCAACCCCTCACCCACTTCGTCCTCTACTCCAG 4 4 50 
K H H T Q N Q P L T H F V L Y S S 
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CGCCGCCGCCACCCTCGGCAGCCCCGGCCAAGCCAACTACGCCGCCGCCA 4 500 

A A A T L G S ? G Q A M Y A A A 
A0G7CTTCC7GGAGGGCG7GGGCA07CAGCG77A7A0GGAAGGA;:AA7GC 4 5 50 
N A FED A E A 7 H R H 7 Q G Q P 
G G G AG C .AC C A 1' C G G C 7 G G G G G A T G 7 G G G .A G A G G AG G AG C AG .AC T 7 A G C AG 4 600 
A 7 T I A W G X W H 7 7 7 7 7 7 S 

G 7 7 7 S E R D R I A < ; G G F 1 
G G .A . 7 7 7 G G A. C G A G G A G G G G A 7 G C 
7 I S D D E G M 

The Avrli-Xfiol hybrid FK-506 PKS module 8 containing the AT domain of 
module 13 of rapamycin is shown below. 

o^_. i -7. GG o'jC 7 G7 ACoP\o ^G G^ 7 A.GG^:CoG A.^^'ovj^G"-. - C'wGG , 'oo ^ oo I ; o \z u 

M R L Y E A A R R T G S ? V V V 
■o ._'0'.: - '^G ^ 'o G7 GG7.^ ^'o-^.'o G G vjjA^ G i G G G oG i w ■ - '. ^'oo j^T^C o GJ 
A A A 7 G D A ? D V ? 7 L R G E R 

^ ^ ^, -n > t, ^ . ^ rr ^- n -~ ^ - r~ — ^ ^ — ^- ^- r» ^ ^ ^ <~ - ^ " <~p ^ ^ ^ ^ ^ - r- - r. " 
^ 'o - w 'O . o - '^J o «_ o i ^, - v o w ^ u i- ^ 'O ! o vj f ^: ' w i ^ i ^ , v_, o „ o r \ ^ w . 

R 7 T V R R A A V R E R S L A D 
j i wT ^ ^jCCG7 GCTGCCCGA.G'oACGA.GCGC^rC^-^ACGC - Tv. _ j. i *l - gj 

RSPCCPTTSAPTPPSRS 
7 G G 7 G G AA C A G C A C G G C G A7G C G 7 G G T C G G C C A C C 7 3 3 G G G G G G A A G A C A. 7 _ L ■. 1 

3 W N S T A 7 V L G H L G A E D I 
G GC3GCGACGA/0GA73G7TGAA\GGAG\C7CGGCA/rCGAG7GGC7'-~ AGGGGG^ v l 
? A TTTFKFGGI 7 S 7 7 A 

i ^ u.' j : j >~ i v_- vj ^ /- J Ln o o ^ i. i Jrtv-L.n^lJU^or.L , wL3j- J i.; - « ^; ' v ^ - ."vrt ^ o w w „ i . 

V Q L R N A E 7 7 A 7 G V R L N A 

AC AG GG G T C T T C G AC 7 7 7CC G AC G C C GC G C G C G C 7 C G G G G 7 G AG AG 7 0 G G -5 0 0 

7 A V FDFPTPRALAARLG 
CG AGG AGC TGGCCGG7 AC CCGCGGGCCCGTCGCGGCGCGGACCGGGGCC A 4 50 

OELAGTRAPVAARTAA 
CCGCGGCCGCGCACGACGAACCGCTGGCGATCGTGGGCATGGCCTGCCGT 500 
TAAAKDEPLAIVGMACR 
C7GGCGGGCGGGGTCGCG7CGGCAGAGGAGC7G7GGGGTGTCG7CGCG7C 5 50 

L 3 G G V A S p Q E L W R L V A S 
C G G 0 AC C G AC G C C A7 C A G G G A*G 7 7 G G C C G CG G ACC u ^ G vj G T o G ^^CG . G G 60 w 

G7DAITEFPADRG W D V 
AC G G G G T G 7 AC G AC C CG G AC C C C G AGGC G A 7 C GGC AAG AC C T7 CG7CGGG 65 0 
DALYDPCPDAIGKTFVR 
GAGGGCGGC7TCC7CGACGGTGCGA.GCGGC7TGGAGCCGGCG77C7TGGG 0 0 0 

HGG FLDGA7G FDAAFFG 
GA7GAGCCCGCGCGAGGCCCTGGCCATGGACCCGCAGC AACGGGTGCTCC 7 5 0 

ISPREALAMDPQQRVL 
TGGAGACGTCCTGGGAGGCG7TCGAAAGCGCGGGCA7CACCCCGGACGCG 8 00 
LETSWEAFESAGIT PDA 
GCGCGGGGCAGCGACACCGGCG7GTTCATCGGCGCGTTCTCCTACGGGTA 8 50 

ARG SDTGVFI GAFSYGY 
CGGCACGGGTGCGGATACCAACGGCTTCGGCGCGACAGGGTCGCAGACCA 90 0 

GTGADTNGF GATGSQT 
GCGTGCTCTCCGGCCGCCTCTCGTACTTCTACGGTCTGGAGGGCCCTTCG 95 0 
SVLSGRLSYFYGLEGPS 
GTCACGGTCGACACCGCCTGCTCGTCGTCACTGGTCGCCCTGCACCAGGC 10 00 

V 7 V DTACSSSLVALHQA 
AGGGCAGTCCCTGCGCTCGGGCG.A.ATGCTCGCTCGCCCTGGTCGGCGGTG 10 50 

GQSLRSGECSLALVGG 
TCACGGTGATGGCGTCGCCCGGGGGA7TCGTCGAG77C7CCGGGCAGCGC 1100 
VTVMASPGGFVEFSRQR 
GGGCCCGCGCCGGACGGGCGGGCGAAGGCGTTCGGCGCGGGCGCvjGACG^; i 1 50 

GLA P DG RAKAFGAGADG 
TACGAGCTTCGCCGAGGGCGCCGGTGCCCTGGCGGTCGAGCGGCTCTCCG 12 00 

TSFAEGAGALVVFRLS 
ACGCGGAGCGCCACGGCCACACCGTCCTCGCCCTCGTACGCGGCTCCGCG 12 50 
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D A E R H G H T V L A E V R G S A 
GC7AACTCCGACGGCGCGTCGAACGGTC7G7CGGCGCCGAACGGCCCCTC 1300 
A _ N 3 D G A S K G L 3 A ? N G ? S 

1\ w v.' '.j -w o - ^ r-. - v_-^v^ '^.'-'i'.j'o ^. ^ . w J - o o -^j -. „ 1 w ^ ~ ^ O ^ -j ^ \j 

Q E R 7 I K Q A L A N A E A T P 
G G.~7 G7 GG A.CGG GG7 GG A.GGCG CA. OG ~A ^ 7 o G 7.-^0 GC GG G - ^.;GG^AG 1 -1 00 
A G 0 A V E A H G 7 G 7 R G G D 

? L G L G S L K S N I G 7 A Q A 

r— rp ~ — . ^ ^ m ,-, r ^ t - ^ r - - <-i ^ - — ^ ^ t- i - ■ — ,-. ^, t, — — ; C " P> 

I -r.ou'ju . ^o^'^u'J o M I _ j m * 'o>j - '~ .*--■_: o _ — - - — ovjj'o — - ^' 

A S G V A G I I K M 7 0 A _ R H G 
^nJ-vCT oC^uC v.G.A^ 1 —.GGGC/-iGGGGvGAGGAGG , ^--j j. Oo^-v j ^-.-^..j ; ^ jA'^ . i G 
15 E L P P T R H A D E ? 3 ? P V D V; 

vj ^ w ^j^j i a ^} x ^o. M .o^- : ^ v_ ^ vj.*v_ * ^-j'-j: ^ ^ o ^ o i -jo vju jo.". ^ w - J 

T A G A V E G L T S A R ? W E G 

^ ^ o \j * ' v G w I .-^G '— " v - j o G o i^j w o G J . G G ^ ^ - ^ ^ G '-J .".o » - / J >w >_ . . ^ 1 J J 

7 G R P R K A G 7 S 3 E G S G T 

— rv.". „'o — ^ v-.*". - o - - — ^ oU.Hun j .*"i. l wv- '^'o ^ i w'-.o ~, — 'o ^ u 1 J 'J 1 j .* *. _ ■ \j 

N A H V I L E S A P P A Q ? A E E 

JULJL.^clui'J. i O f\ O ^ O O ^ 'o J? i o ^ ^ i o j; ."!. - vj ■. ; v . . ; J^^O^ . Uvj J. kJ 

AQPVETPV V A S D V A P L 
7GA7ATCGGGGAAGAGCCAGCGGGGGG7GAGGGAACAGG;GvGAGCGGG7G ie50 
25 V7SAKTQPAL7EH EGRL 

CGCGGCTAGw /GGGGGCGG'CGCCGG'wG'oGGGT-. GAG AGGG TG7GGG.AI C j. 9'J0 

R A Y G A A S ? G A E) I R A V A S 
GAGGGGGGGGGGGA.GA.CGGTCGGGGGTGGAGG^GGGG G GG"! .-iCTGC7 1 7G 1 0 
T L A V T R S V E E ri R A 7 G G 
30 GAG A7GA.GAGGG7CACCGGCAGGGGGGTGA0GGAGG G GAGGA.7GG7G77T G 0 0 0 
G D D T V 7 G T A V T D P R I V E 
GTGTT7CGCGGGGAGGGGTGGGAGTGGGTGGGGATGGGGAG7GGACTGGG 2 05 0 

V E P G Q G W Q W G G M G S A h R 
CGATTCGTCGGTGGTGTTCGCCGAGCGGATGGCCGAGTGTGCGGCGGCGT 2100 
35 DSSVVFAERMAECAAA 

7GGGGGAGTTGGTGGACTGGGATG7G77GAGGG7TCTGGA7GA7GGGGCG 2150 
G REE V T D W D1ETVLDDP A 
G7GGTGGAGGGGGT7GA7GTGG7GG.GGCC0GG7TGG7GGGG G^TGATGGT 2 2 0 0 
VVDRVDVVQPAS W A M M V 



40 



SLA A V W Q A A G V R P G A V 



I G H S Q G E I A A A C V A G A V 
TCACT ACGCGATGCCGCCCGGATCGTGACCTTGCGCAGCCAGGCGATCGC 2 3 5 0 
45 5 LR DAARIVTLRSQAIA 

CCC-^ti GGG7 GGCGGGGCGGGGCGGGATGGG A'i'GGG TCG GGCT'GCCCGGGG z^00 

R G L A G R G A M A S V A G P A 
AGGA7GTCGAGGTGGTCGACGGGGCGTGGA7GGGGGGGCAGAAGGGGCCG 24 50 
QDVELVDGA W I A A H N G ? 
50 GCCTCCACCGTGATCGCGGGCACCCCGGAAGCGGTCGACCATGTCCTCAC 2 500 
ASTVIAGTPEAVDHVLT 
CGC7 1 CA7GAGGCACA„AGGGGTGCGGGTGGGGCGGA.7CAGGG PGGACTATG 2 5 d 0 

AHEAQGVRVRRITVDY 
CCTCGCACACCCCGCACGTCGAGCTGATCCGCGACGA.ACTACTCGACATC 2 600 
55 ASHTPHVELIRDELLDI 

ACTAGCGACAGCAGCTCGCAGACCCCGCTCGTGCCGTGGCTGTCGACCGT 2 650 

T S D S S S Q 7 ? I V P W L S T V 
GG AC GGC ACCTGGGTCG AC AGCCGGCTGGACGGGGAGTA.CTGGTACCGG A. 27 00 
D G T W V D S PLDGEYWYR 
60 ACCTGCGTGAACCGGTCGGTTTCCACCCCGCCGTCAGCCAGTTGCAGGCC 27 50 
NLREPVG FHPAVSQLQA 
CAGGGCG.ACACCGTGTTCGTCGAGGTCAGCGCCAGCCCGGTGTTGTTGCA 2800 

QGDTVFV EVSASPVLLQ 
GGCGATGGACGACGATGTCGTCACGGTTGCCACGCTGCGTCGTGACGACG 2850 
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A H G D D 7 V T V A T G R R D D 

g g> _ a t r m l t a l a ; a y 7 h g 
gggaggg^fgagtggggggggatoggggggaggaoo^aaggggggtagt 2 95 0 
7 t 7 g v; p a i g g t 7 g 'i 1 r v g 

G G p G V A F G H v ; ?. V W L E S 
A P p A T A C S G H ? 7 G G 7 G V 
A V A G S ? G R 7 



3100 



o v_ o ^ *o » i ^ vj: j. j u 

A D R A 7 F I A E G A G A A A D 
CGAGGGAGTGGGGGAGGG7GGAACA30TGGAGG73A30TCGG7GG0GGGG 3 2 00 
15 A T G C A T V E Q G 0 7 G S V R G 

G oA^TOGG GG G GGG wG A.u GGGG A.GGG G GG^GA.GG .Gjj. Gor-.T'oA^CGGGC 32; 7 
3 7 A R G R A n ' A Q ~~ W 7 D F P A 

A 3 G . . R R FT V K G R 7 G DA 
20 GGTGGA.GGGTGGA.GGGGGAGGGGGTGGGGCGGGGGG3 3GGGGTGGCCCAG 7 3 50 
P W T L H A F G V L R ? G R V ? 0 
GCCG^AGC w G TG GA.CACC GGGTGG G OGCG G GC G o'~ _ «~ ■_ ooT oG ^CoCour 1 , ji v 
? F A 7 D T A W ? P ? G A V P A D 

Lo'j^;, 1 ^ ^U'j'j^-o >. 'J ^ r. v »j ^ u „ o 1 j n ^: j - ^- - ± ■— — * a ^u.^j^wJ ^ ^ J 

25 G L ? G A W R R A 3 Q 7 F V FA 

.-lAG TG ^ AG G G G " GA.CGGGTTGGT ~ ^ .~AGAG - v^. j.t^^ - v^G ^ConCoCu Go CO 
E 7 C S P D G F 7 A H ? 3 G L D A 
G a G ^ T G i OGuGGGTG. viGCG.-.OoGG o ... G o '„ G A .j _ - ~ ~ ~ '■ j ; j A T G G o C G A 3 7 C 
V F S A 7 G D G 5 R G 3 7 G W R D 
30 G0TGGGGGTGGAGGCGTGGGA,GGGGA3GGTG0TGGGGGGGTGCCTCAGCG 3 6 70 
L A 7 HASDATVLRACL T 
GCCGCGACAGTGGTG7CGTGGAGCTCGCCGCC7TCGACGGTGCCGGAATG 3 65 0 
RRDSGVVELAAFDGAGM 
CCGGTGCTCACCGCGGAGTCGGTGACGCTGGGCGA.GGTCGCGTCGGCA.GG 3 7 00 
35 PVLTAESVTLGEVASAG 

GGG AG 3GG7aGG AGTGGG A.GGGTGTGOTTGo^ G . _ G AGToldTTGCGGGTGG 3 7 o 0 

GGDESDGL3RLFWLPV 
CGGAGGGGGAGTACGACGGTGGCGAGGAGCTGGGGGAGGGGTAGACCCTC 38 0C 
A E A H Y D G A 3 E L ? F G Y T L 
40 A T C A 0 C G G G A G A G A G C C G G A C G A G C G G G A C G A 0 G 3 3 .A G C AA CCCCCAC AA 3S5 0 
I T A T H ? D D P D 3 P G N P H N 
C A C A C C C A G A C G G A G C C A G A G A C AAA CCACACGCGTCCTCACCGCCCTCC 3 9 00 

7PTRTH7QTTR7LTAL 
AACACCACCTCATCACCACCAACCACACCC7CATCG7CCACACCACCACC 3 9 50 
45 Q H H L I T T N H T L I 7 H T T T 

GAGCGGGGAGGGGCGGGGGTCACCGG70TGAGGGG0AGGGGACAAAACGA 4 000 

D ? ? G A A V T G L 7 R 7 A Q N E 
ACACCCCGGCCGCATCCACCTCATCGAAACCCACCAGGCCCACACCCCAC 4 0 50 
H PGRIHLIET H H P H T P 
50 TCCCCCTCACCCAACTCACCACCCTCCACCAACCCCACCTACGCCTCACC 4 100 
G P L T Q L T T L H Q F H L R L T 
AACAACACCCTCCACACCCCCCACCTCACCCCCA7CACCACCCACCACAA 4 150 

N N T LHTPHLTPITT H H N 
C A C G A G G A C AA C C A C C C C C AA G A G G G 0 A G C C C T G A_A G G G G -AA CCACGCCA 4 2 00 
55 T 7 T T T ? N T P P u K P N H A 

TCCTCATCACCGGCGGCTCCGGCACCCTCGCCGGCATCCTCGCCCGCCAC 4 2 50 
ILITGGSGTLAG3LARH 
CTCAACCACCCCCACACCTACCTCCTCTCCCGCACACCACCACCCCCCAC 4 300 
LNHPKTYLL3RTP? PPT 
60 CACACCCGGCACCCACATCCCCTGCGACCTCACCGAGCCCACCCAAATCA 4 3 50 
TPGTHI PCGGTDPTQI 
CCCAAGCCCTCACCCACATACCACAACCCCTCACCGGGATCTTCCACACC 4 4 00 
TQALTHIPQPLTGI FHT 
GCCGCCACCCTCGACGACGCCACCCTCACCAAC 
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A j, T L D D A T L T N L T P Q H L 

caccaccaccctccaacccaaagccgacgcgggctggcacctccaccacc -;seo 

AG AC GCAGGGACCAACCCCTCACCCAOTCCGT GCTGCACCCCAGCGCCGCC -3 550 

5 :-: t c q r- l g :•: f v v s s a a 

a t g g s f g c a :) ':' a a a v, a f 

gcagaghrgtqc^pag 
10 gcatcgcctggggcagccggcacaccaccaccagagccaccagccaagtc -; g og 
g : a w g m w h t g 7' t g t 3 q l 
accgacagcgagcgcgaccggatccgccgcggcggccccctgccgatctc mbz 
tdsdrdrirrg g f c p i s 

^^.^■^ G -o A G'j oGATGG 
15 D D F G M 

The Nhc\-Xho\ hybrid FK-506 PKS module 8 containing the AT domain of 
module I 2 of rapamycin is shown below. 

GC ATGGG o i ^G oTA^CG AGGCG G G ACXjG C GCACCGoA-h^jT ~CCGTGGiGG-G 5 0 
20 M R L Y F A A A ?, T G S P V V V 

■o — o 'o : . . lG^c ^. ? ... o C G A C 'o v_. . j <~ 'o G r*. G G i uLCo ^ 1 ^ ^ . 'j-w'j^uo^C * G ^- 'o i ^ j 

A A A G D D A ? 0 7 ? G G R G L P, 
G ; v. GT AC o/aGGGGGCGG GG Go GCGCCGTCCGooAA.^k:) ^GGI ^CC^^Gu^C^ * ~ 0 
R T T V R R A A V R E R 3 L A D 
25 G G TCGCCG T G C I G CC C G AC G A.C G A G G G G G CCG ACG C G T C CC T C G GG T T Co GO 0 
R S ? C C P 7 T 3 A ? G P P S R S 
T C C T G G AG 1 . G A v 5 G A C G G G G A G G G " 1 G C T C G G G G A C C T G G G G G C C G r-G-.G AC A. P G s 0 

S W N 5 T A G 7 G G H G G A £ D I 
v-^^GGG'o-^vjr-.GG^CGTG^ '■.-•-u'j/vACT^ ^GG.nT „ - lGCT ^A-CCG'^^j^j G j0 
30 PATTTFKEFG103LTA 

TCCAGCTGCGCAACGCGCGGACCACGGCGACCGGCGCACGCCTCAACGCC 3 50 
V Q L P. K A L T T A T G V R L N A 
ACAGCGGTCTTCGACTTTCCGACGCCGCGCGCGC7CGCCGCGAGACTCGG A CO 
T A V F D F P T P R A L A A R L G 
35 CGACGAGCTGGCCGGTACCCGCGCGCCCGTCGCGGCCCGGACCGCGGCCA A 50 
0 F G A G T R A ? V A A R T A A 

w j 1 ~ o _ o w.-Vv^i.H^cr^^^ ~3 ^ - o j : > ijuuv,.- 1 . ± o o ^. x o^L-*^ ~ ^ ' J w 

T A A A K D E F L A I V G M A C R 

C G G G G GCG C G G G G GC A G A G G A G C I* G 7 G G C G T C T Co T CG CG 7 C 5 ~' 0 
40 L ? G G V A S ? Q E L W R L V A S 

G GGC AC GG AG G CC AT C AC GG AG TT CC CC G C GG ACCGCGG CT GG G ACGTGG 6C0 

GTDAITFFFADRGWDV 
ACGCGCTC7ACGACCCGGACCCCGACGCGATCGGCAAGACCT7CGTCCGG £50 
D A L YD PDPDAIGK7 F V R 
45 GACGGGGGGTTGCTCGACGGTGCGACGGGCTTCGACGGGGCGTTCTTCGG 7 00 
H G G F L D G A T G F G A A F F G 
GATCAGCCCGCGCGAGGCCC7GGCCATGGACCCGCAGCAACGGGTGCTCC 7 50 

ISFREALAMDFQQPVL 
7 G G A G A C G 7 C C 7 G G G A G G C G 7 T C G AAA GCGCGGGCATCACCCCGGACGCG 3 00 
50 LETSWEAFESAGITPDA 

GCGCGGGGCAGCGACACCGGCGTGTTCATCGGCGCGTTCTCCTACGGGTA 6 50 

ARGSDT GVFIGAFSYGY 
CGGCACGGGTGCGGATACCA^CGGCTTCGGCGCGACAGGGTCGCAGACCA 900 
GTGADTNGFGATGSQT 
55 GCGTGCTCTCCGGCCGCCTGTCGTACTTCTACGGTCTGGAGGGCCCTTCG 950 
SVLSGRLSY FYGLEGPS 
GTCACGGTCGACACCGCCTGCTCGTCGTCACTGGTCGCCCTGCACCAGGC 10 00 

V T V D T A C S S 5 L V A h K Q A 
AGGGCAGTCCCTGCGCTCGGGCGAATGCTCGCTCGCCCTGGTCGGCGGTG 10 50 
60 GQSI. RSGECSLALVGG 

TCACGGTGATGGCGTCGCCCGGCGGATTCGTCGAGTTCTCCCGGCAGCGC 1100 
VTVMASPGGFVEFS RQR 
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oG<j'.' T GG GG C GG GaCGG GCG GGG GA~ ; vo o - ~ ^ ~ o G G C GCGGACGG 1150 

G I , A P r; G R A K A F G A G A G G 

T 5 Y A E G A G A L 7 V F R G S 

G A G R H G H G 7 L A G 7 A G S A 

A W 3 G G A S N G L G A F K G ? S 

10 G G R V 1 H Q A L A G A K G T P 

A i: V G A V E A H G G G G R G G D 
CCCAT GG A GGCG G AGG GG G T GCTC GCGAA" 37 A - G G A GA.G GA*C GGGGCGAG 1 4 o 0 
F I G A Q A I, I. A T V G Q G RAT 

RGGLGSGKSGIGHAQA 



15 



G G V A G I' K M 7 G A I R H G 

20 G G ? ? T L H A G L F S P H V D W 

i o ."iCo G ^ o o o 5 o C G 'jTC'ijAGl T o * G ."*C kj i w o j ^C^oGw o ^ i ^ j ^CG G oG ~> i c 50 
T A G A V E L L 7 3 A R F W ? G 

T G R ? R R A A V G G F G 7 S G G 
— 5 .".A G G G G G A G r\ T G v T C C T T G A G G C .--i o G \ G G G -G., rv.- G . r o G G — o G T G G j. ' 
N A K I I L F A G F 7 K T G ? V E 

G 'o'^A.o o ~. jCu. m . . G oAoiGCA.GGA.Cl. o o 'I CG.-v.-vo - .-.'ju; ^ ^ o - o o G AG^C - o _ t o 0 

A G A I E A G P V F 7 G ? V F A 

G A.CCGCT GCCCGCGGCGCCG CCG TC.AGCG^ 2 GG GoOGrGAo-.-CCTTCCGCTo ^ i.; 5 0 
30 G P L ? A A P ? S A ? G F G G ? L 

GG GGG G T C GGG G G G T ? CC C G GG AG G C AC 7 3 G A 0 GAG C AG A? GGGGCGCGT 1900 

L V S A RSPEALCEQIGRL 

GCGCGCCTATCTCGACACCGGCCCGGGCG7CGACCGGGCGGCCGTGGCGC 19 50 
R A V L D T G P G V G R A A V A 

35 AG AC AC? G G CC CG GCG 7 AC G C AC? 7C ACCCA.CC GGG CCG? AC TGCTCGGG 200 C 
Q T L A R R T H F T H R A V LEG 

GA.CACCG7 C.A'I GG CtC,CjC.T QQQGC.C "3 GoGA.CCAGGCCG.~..CGAJACTCG?C?T 2 0 o 0 
D T V 1 GAP PALQADELVF 

r>pfnr>A-^ r"^.^'-"-""fr -> a • r , ^- — - ~i — -> ~ m " — ^ tv ^. rp ^. ^ i ^, ^ 

^.u i v. „ * v.. ... o'J - ^ .'-io ooo..* i'^. w 'w,r.o , _ ... .J v.'.j;\ ^ o j J o 0.-^ j .^Mojo i no ^ 1 „ w 

40 7 V 5 G Q G T Q H ? .-. M G F Q L 

C G C o, G G o T 'i G_- G C C G 1' C r T G G G o ^.v: j ; ± ^ - w r.i _ .-^ o o *> o o ; . ^ T jobnC^-T j ^ ^ 5 C 
A A A F ? V F A R I H Q Q V W D L 
CTCGATGTGGGGGATGTGGAGGTGAAGGAGAGGGGTTAGGGGCAGGCGGC 2 2 00 
LDVPDLEVNETGYAQPA 
45 CCTGTTCGCAATGCAGGTGGCTCTGTTCGGGCTGCTGGAATCGTGGGGTG 2 2 50 
G F A M Q V A L F G I, h E S W G 
T ACG AGC G G AC GC GG? G ATC GG C G AT G G G GT G GGTG AG GTTGCGGCTGGG 2 300 

V R ? D A V I G H 3 7 G F L A A A 

? ATGTG7CCGGG ~TG?GG?CGT?G GAGG.ATGG 2TGG AC GG ? GGTGTCGGC 2 350 
50 V V S G V W S h F G A G T G V S A 

GCGGGCTCGTC ? GATGG A GGCTC? GC C G GG G GG7GGGGTG AT GG?CGCTG 2400 

R A R L M Q A L ? A G G V M V A 
TCCCGGTCTGGGAGGATGAGGCCCGGGCGGTGGTGGGTGAGGGTGTGGAG 2 4 50 

V ? V 5 E D E A R A V L G E G V E 

55 ATCGCCGCGGTCAACGGCCCGTCGTCGGTGGTTCTCTCCGGTGATGAGGC 2 5 00 
I A A V N G ? S S V 7 L S G D E A 
CGCCGTGCTGCAGGCCGCGGAGGGGCTGGG G.A-.G7GGACGCGGGTGGGGA z d 5 0 

A V L Q A A E G L G K W T R L A 
CCAGCCACGCGTTCCATTCCGCCCGTATGGA.ACCCATGCTGGAGGAGTTC 2 600 

60 t s :-: a f k s a r e-: e ? m l e e f 

CGGGCGGTCGCCGAAGGCCTGACCTACCGGACGCCGCAGGTCTCCATGGC 2 650 

RAVAEGLTYRTPQV SMA 
CGTTGGTGATCAGGTGACCACCGC7GAGTAGTGGGTGGGGCAGGTCCGGG 27 00 
V G D Q V T T A E Y W V R Q V R 
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AC AC G G 7 C 3 G G 7 7 C G G C G AG C A 3 G 7G G 3 C 7 C G 7 A 3 G AG G AC 3 3 C G 7G 7 7 C 2 7 50 
3 T V R F G E Q V A 3 Y 2 D A V F 

V F L G A D R S 3 A R 3 7 D G V A 

5 GA7GC7GCA.C 3GCGACCA.CG AAJY7CCAG 33 2 333 A7C33 333 3G7GGC 3 C 2 9 2 0 
M L H G C H F I Q A A I 3 A 3 A 

A3G7GTAT3T3AAG333G73AGG37C3A3T3G3333:G37C3 , :3GGC3A7 2°0 0 
H 3 V V M G V 7 V 3 W P A 2 2 G D 

G ^7CGoGCA.*v3A3GGG7GG7G3ACC7T33GACA - rVGGC777 CCAGGAC 3A c';jG 
10 A ? A T R V 2 D L ? T V A F Q H Q 

1 j 'o w i ."i i G 3 ^ ^ G AG 2 C '-_3 >a T 3 C *w ^ 3 o ^ 3 r\ ^ 3 G 3 ^ ^ _ -o o ^ 3 3 ,-\Q~ , 3 J Y u 

R Y W L E 5 A ? P A T A D S G K 

3 w G T 3 1 o o 3 ^. ^ G \j AG T C o 3 Y. 1 C^'^Cooo i j - ^. v v;u : o - : - . - ^ j j ^ 0 
P V 3 G 7 G V A V A G S F 3 R V F 

15 /Vv J GGG7CCCG7GCCCGCCGGTGCGGACCGGGCGG7G77C,-.7CGCCbAAC ) Yl CO 
T G P V P A G A D R A V F I A E L 

G-^CGCTCGCCGCCGCCGACGCGACCGACTGCGCCAXGGTC'..: .----.GAG^ 1 33 j j. " 0 

A L A A A D A T D 3 A 3 " 3 Q L 

/^3vjT^.A332 1 33oTG3CCGG3 : vcjGATCCG333GCGGC. . ^ o'^'^ o'^GC^c Y Y 
20 3 V TSVPGGSARG R A 7 A Q 

AC3T3GGTCGA7GAAGC3GCCGCCGACGGGCGGCGC33C7T3A3CGTCCA 22 2 3 
7 W V 3 3 ? A A D G R R R F T V H 

7 R V G D A P W 7 2 H A F G V L 
25 GCCCCGGCCGCGTGCCCCAGCCCGAAGCCG7CGACACCGCC7GGCCCCCG 2 3 5 0 
R P G R V ? Q ? E A V C 7 A W ? P 
CC'vj:GG3 , o3G37'oCCCGCGG.n3GoGC7G wC ^GGGG 1 -^; * v^^w jh-^^u^GGr. .j 4 Y 0 

P G A V P A D G L ? G A W R R A D 
C3AGG-7377CG7CGAAGCCGAAG7CGACAGCCC7GACGGG723G7GGCAC 34 50 
30 Q V F V F A F V D S ? D G F V A 

AC 3 CCvj A332 3 3 7CG ACGGGG 7CT7C7CCGCGo 7l-vjGC'j.~vC AGCCGC Y 2 Y 0 
H P D 3 3 D A V F 3 A V G 3 G S R 
CAGCCGACCGGA7GGCGCGACC7CGCGG7GCACGCG7CGGA3GCCACCG7 35 5C 
QPTGWRDLAVHASDATV 
35 GCTGCGCGCC7GCCTCACCCGCCGCGACAGTGG7GTCG7GGAGCTCGCCG 3 600 
LRACL7RRDSGVVELA 
CCTTCGACGGTGCCGGAATGCCGGTGCTCACCGCGGAG7CGGTGACGCTG 3 650 
AFDGAGMPVL7AESV7L 
GGCGAGG7CGCG7CGGCAGGCGGA7CCGAGGAG7CGGACGGTG7GC77CG 32 0 0 
40 G K V A S A G G S 3 F S D G 3 L R 

G377GA/37GG77GCCGG7GGCGGAGGCCCAC7ACGAGGG7GGGGACGr\GG j < ~C 

3 F W 3 P V A E A H Y D G A D E 
7 G C C C G A G G G 3 7 A 3 A* C 3 C 7 C A 7 C A C C G C 3 A C A C A 3 3 C C G A 3 G A C C C C *o A C Y 3 Y 0 
3PEGY7L:7A7HPDDPD 
45 GACCCCACCAACCCCCACAACACACCCACACGCACCCACACACAAACCAC 3 8 50 
DPTNPHNTPTRTHTQTT 
ACGCGTCCTCACCGCCC7CCAACACCACCTCATCACCACCAACCACACCC 3 9 00 

RVLTALQHHLITTNHT 
TCATCGTCCACACCACCACCGACCCCCCAGGCGCCGCCG7CACCGGCCTC 3 9 50 
50 3 2 V H 7 7 7 0 P ? G A A V 7 G L 

ACCCGGACGGGACAAJGV2GAACACCCCGGCCGCA7CCACC73A7CGAAAC 4 0 00 

7R7AQNFHPGRI H 3 I E T 
CCACCACCCCCACACCCCACTCCCCCTCP.CCCAACTCACCACCCTCCACC 4 050 
HKP4TPLPLTQLTTLH 
55 AACCCCACCTACGCCTCACCAACAACACCCTCCACACCCCCCACCTCACC 4 100 
QPHLRLTNNTLKTPHLT 
C C C A T C AC C A C C C A C C AC AAC AC C AC C AC AAC C AC C C C C AA C A C C C C AC C 4150 

?IT7tiKNTTTTT P N 7 P P 
CC7CAACCCGAACCACGCCA7CC7CA7CACCGGCGGCTCCGGCACCC7CG 4 200 
60 L N F N H A I L I 7 G G S G T L 

CCGGCA7CC73GCCCGCC3 CCTCAACCA.CCCCCACACCTACC7CCTCTCC 4 2 50 
A G I LARHLNHPH7VLLS 
CGCACACCACCACCCCCCACCACACCCGGCACCCACATCCCCTGCGACCT 4 300 
R7PPPP77PG7HIPCDL 
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XCCACCCAAATCACCCAAGCCCTCACCGACATACCACAAOCCC 4 350 
P 7 Q I 7 7 A L T H I ? n p 

^ : F H T A A 7 7 D 7; A 7 7 T 



A W H L H H H T Q N Q P L 7 H IT 
TCCTCTAC7GCAGCGCCG0CGCCAC0C7CGGCAG0GC0GGCCAAGC0AAC 
V L Y S S A A A 7 L G S ? G C A N 

T Zi i'* r" ^ " ^ ^<-/-""m-p«/-.rp'-" ( -| ^ .-^ ^ r-. — — — ^_ - ^ ^ ,-■ - — " "< ; ^ £ ^ ^ ^ 

' " ^ " " 2* r, ^ i 1-1 ^ ^ r: ^ ^ ^ r. m ^ ^ ^ " ^ ^ ~ ^ r. ~ ' ~ ~~ ^ j . z ^ ^ p+ c l 
Q G Q P A 7 T I A V: G X W H T 7 



1 5 GGAGAG7GA:GAGGGAAG7GAGGGACAGGGAC 

TTL7SGLTDSDR7p:RR 
jG'- G'j- . i ^_ 7 C ^ A* r 7 G o rv^ 'o A w G A G \j G ^ A ~ o ^ 
G G F L ? I S u D E G X 

20 The Nhe\-Xho\ hybrid FK-506 PKS module S containing the AT domain of 

module 13 of rapamycin is shown below. 

MRLYEAARRTGS? T v' V V 

25 A A A L D D A ? D V P L L R G L R 

G GG 7 AG G AC C G 7 CC G GC G7 GC C G C C G 7 C C GG G AAC G G 7 0 7 C 7 C GC GG AG G 1 50 

RTTVRRAA V R E R S LAD 
GC7CGGCG7GC7GCCCGACGACGAGCGCG0CGA0GCG7CCCTCGCGTTCG 2 0 C 
R S ?CC?TT5A?TPPSRS 
30 TCCTGGAACAGCACCGCCACCG7GCTCGGCCACC7GGGCGCCGAAGACA7 2 50 
3 W N S T A T V L G H L G A E D I 
C GC G G G GACG AC GACG 7 T C AAG G AA C T CG GC AT C G AC 7 C G C 7 C A 0 C G C G G 50 0 

PATTTFKELGIDS L 7 A 
TCC AGCTGCGCAACGCGC7GACCACGGCGACCGGCGTACGCCTCAACGCC 3 5 0 
35 VQLRNALTT A 7 G V R L N A 

AC AG C G G T GT 7C G AC 7T 7C CGACG CCG CG GG CGC T GGC G G CG AGAG TCGG 4 00 

T A V F D F ? 7 P R A L A A R L G 
C GACG AG CTGGCCGG TAG C CG CGC G C CCG TCGCG G G G GG G AO' CG G G GCC A 4 0 0 
D F 7 A G 7 R A ? V A A R 7 A A 
40 ^OGCGG'wCGGGCACG AC G AAC C G C T G G C G AT G G 7 ^ o ^; ^ o o v. ^ 7 . j C G o T 0 L 
7 A A A H D E P L A I V G M A C R 
CTGGCGGGCGGGGTCGCG7CGCCACAGGAGC7G7GGCG7C7CG7GGGG7C 5 0 

L P G G V A S P Q E L W R L V A S 
CGGCACCGACGCCA.TCACGGAGTTCCCCGCGGACCGGGGCTGGGr " "7GG 60 ^ 
45 GTDAITEFPADRGWDV 

ACGCGCTCTACGACCCGGACCCCGACGCGATCGGCAAGACCTTCGTCCGG 6 50 
DALYDPD PDAIGK7 F V R 
CACGGGGGCT7CCTCGACGGTGCGACCGGCTTGGACGCGGCG7TG77CGG 7 00 
H G G F L D G A T G F C A A F F G 
50 GATCA G G C C G C G C G AG GCCCTGGCCA. TGG AG CCG 0 Au <~ AAC o G ^ 7 G ^ 7 C C / -> 'J 
ISPREALAK DPQQRVL 
T G GAG AC G T C C T G G G AGG C G 7 7 C G AAAG C GC G G G G A7 G AC C C C G G ACG C G 8 0 0 
LETSWEAFESAGIT PDA 
GCGCGGGGCAGCGACACCGGCGTG77CA7CGGCGCG7TC7CC7ACGGGTA 8 50 
55 ARGSDTGVFIGAFSYGY 

CGGCACGGG7GCGGATACCAACGGC77CGGCGCGAGAGGGTCGCAGACCA 900 

GTGADTNGFGATGSQT 
GCG7GCTC7CGGGCCGCCTC7CGTACT7CTACGG7CTGGAGGGCCC77CG 950 
SVLSGRLSYFYGLEGPS 
60 GTCACGGTCGACACCGCC7GCTCG7CGTCACTGGTCGCCCTGCACCAGGC 10 00 
VTVDTACSSSLVALKQA 
AGGGCAGTCCCTGCGCTCGGGCGAATGCTCGCTCGCCCTGGTCGGCGGTG 1050 
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GQSLR5GECSLAL7GG 
TGACGGTGATGGCG7CGCCCGGCGGATTCG7CGAGTTGTCCCGGCAGCGC 1100 

v . 7 v A s ? g G F 7 e it s r q R 

'vj o \j: i „ 'o G: > ^ - c G n ^ v 'o o v u o - J'- v; .~ 'o o ^ i 'o ^ o L '..j o — - ^^'o'jn^ov! x 1 5 0 

t> G L A r G G R A ?' A F G A G A G G 

t o .'"i. ^; * * ^ J ^ L. g rA j .; o ^ U v U J - J - ^ i J J i J .; . '.,'.1."..; ^ j J ^ T^l ^'^ j _ 2 0 0 

T G F A E G A G A G 7 7 F R L S 
i-v-. ■o~j.- i .G ! ^ o G C ACGGCCA.C.-.G GG - G d CG ^. G w T JG AA 1 ^ j-j^Tw^G^ «.j _ 2 o 0 
D A F R K G H T V L A L V ?. G S A 
10 GCTAAC7CCGACGGCGCGTCGAACGGTCTGTCGGCGCCGAACGGCCCCTG 1300 
A N S G G A S N G 7 S A P K G F S 

ccaggaacgcgtcatccaccaggccctggcgaacgcgaa-~ctcacccccg 135G 

Q F R 7 I K Q A L A N A K G ? ? 
v-^ATG I GGr-vGoGGGTGG--nGGG^-j GrvlGGC^C G^^C/-.A^ _ ^ — _ - CoGCG.-vC 14 0 0 
15 A D 7 D A V F A H G 7 G G F G G G 

G^GATG ^ AGov^oCAGGCGCTGCT CGrGu RAv_, '.jor.^.-.'jj.". ~ ^oGGGo-vL. 1 -i 50 
? I E A Q A L L A T Y G Q G R A T 

'J ! .Ij^C 1 o'^ 1 u'^ t CGuC 1 ^ ^ o rv." 1 . ^ * ^ . . - . .-".7 j C"'0 M ^ ^ ^ 'J ~3 ^ ^ o 1 1) 0<J 

P L T L G S L K S N I G H A Q A 
20 CGTCAGGGGICGCCGGGATCATCAAGATGGTGCAGGCCATCCGGCACGGG 1 550 
A S G V A G I I K M V Q A I R H G 
G.nACGGGGGCGGACACGGGAGGGGGAGGAGGGGTG'oGGoG/- G jT^CAl?^ _ cOQ 

E l F P T L K A D E ' ? 3 P H V D W 
■-A.CGGC CGGTGCCGTCGA*GCTCCTGACGTCGGCCCGGCCwT GoGGGGboA _ c D 0 
25 T A G A VELLTSARP W P G 

CCGGTCGCCCGCGCCGCGCTGCCGTCTCGTCGT7CGGCGTGA,GCGGC^CG 1/00 
T G R P R R A A 7 S S F G S G T 

AAC G G G G AG AT CAT CC T 7 G AGG G AG G AG C G G T C AAAAC G G G AG C G G T G G A 17 50 
NAH II LEAGPVKTGPVE 
30 G G C A G G AG CG AT C G AG G C AG G AG GG G T G G AAG T AG GAG GGG7CGAGGG7G 18 00 
A G A I E A G P 7 E V G ? V E A 
G ACCGCTCCCCGCGGCGGCGCCG7GAGGACGGGGCG AAGAGG7TCCGCTG i 8 5 0 
GPLPAAPPSAPGEFLPL 
CTCGTGTCGGCGCGTTCCCCGGAGGCACTCGACGAGCAGATCGGGCGCCT 1900 
35 L V S A R S P E A G D E Q I G R L 

GCGCGCCTATCTCGACACCGGCCCGGGCGTCGACCGGGCGGGCGTGGCGC 1950 

R A Y G D 7 G ? G V D R A A V A 
AGACACTGGCCCGGCGTACGCACTTCACCCACCGGGCCGGACTGCTCGGG 200 0 
QTLARRTKFTHRAV1LG 
40 'w.-.lAGGGTl-A-TCo^CGCTGCGGC GGCGGA.G'-wA.GGGG'bA.C'jr^.'wTCG i G _ . 2050 

0 T 7 I G A p ? A G Q A C E L V F 
CGTG7AGTGGGGTCAGGGCAGCGAGGATGGCGGGATGGGGGAGCAGCGAG 2100 

V Y s G Q G T Q H P A M G E Q L 
CCGATTCGTCGGTGG7GTTCGCCGAGCGGA7GGCCGAGTG7GCGGCGGCG 2150 

45 AD3SVVFAERMAECAAA 

7TGGGGGAGT7CGTGGACTGGGATG7GTTGAGGGTTC7GGATGATCCGGG 2 200 

LREFVDWDLFTVLDDPA 
GGTGGTGGACCGGGTTGATGTGGTCCAGCCCGCTTCCTGGGCGATGATGG 2 2 50 

V V 0 R V D 7 V Q ? A S W A M K 

50 TTTCCCTGGCCGCGGTGTGGCAGGCGGCCGGTGTGCGGCCGGATGCGGTG 2 300 
V S L A A V W Q A A G 7 R P D A V 
A.TCGG CCA.TTCGCAGGGTG AGATCGCCGCAGCTTGTGTGG GG GGTGCGuT 2 3 o 0 

1 G H S Q G E 1 A A A C V A G A V 
GTCACTACGCGATGCCGCCCGGATCGTGACCTTGCGCAGCCAGGCGATCG 24 0 0 

55 SLRDAARIVTLR3QAI 

CCCGGGGCCTGGCGGGCCGGGGCGCGATGGCATCCGTCGGCCTGCCCGCG 2 4 50 
A R G L A G R G A M A S V A 7 P A 
CAGGATGTCGAGCTGGTCGACGGGGCCTGGATCGCCGCCCACAACGGGCC 2 500 
Q D V E L V D G A W I A A H N G P 

60 CGCCTCCACCGTGATCGCGGGCACCCCGGA.AGCGGTCGACCATGTCCTCA 25 5 0 
A S T V I A G T ? E A V D H V L 
CCGCT C ATG AGGC AC AAG G GG TGCGGGT G C GG CGG. AT CACCGTCG AC TAT 2600 
TAHEAQGVRVRRITVDY 
GCCTCGCACACCCCGCACGTCGAGCTGATCCGCGACGAACTACTCGACA? 2 65 0 
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ASHTPKVELIRDSLLDI 
7 3 3 3 S 3 Q T PL 7 P W 1ST 

. >j o / \ - ' j ^ : ; „ ;\ 1 1 1 ^ o ' V 3 G A 1 ,~. -o C CCo^ - 'o G .~Vw ' ~: 'o o 1 C . G'J i ~- C 'v_ o l. 0 L' 

N 1 R 1 ? V G F ri ? A V £ Q 1 C 1 A 

_ w \ ^ o i o _ 'vj .-'.'^ w o a G T i Co _ ^ .* o -j; - '- /-I J ^ o v ^ *r. : ~ - ^ -~ - i ' 'J :Tc^ ^ ^ L 



D T V F 7 E 



10 ^7 ; j 'v.. 'o .—. ^ \j /-v r\ C o i *o i ^ j T^M.^ju i ^; ^ ^ o \- - '• 

Q A X D D D V V T V A T 1 R R D D 
GG:3A3GG3AC3C3GATGCTCACCGCCCTGGCA3A3GCC7ATGTCCACG3 2 95 0 
G 7 A T R M L T A 1 A 1 A V V H G 

15 7 7 7 3 W ? A I 1 G T 7 T T R V 

Tu : or.33TTC3G*-/33TA,CGCCTTl^rA i CrvC7,'-vo^ . ^ u^-^. i ■^■_:-~i. i sj i -w-^ j< ^ d ^ 
L L P T Y F Q H Q R V ° ' £ 

APPA7A3SGHP7LG7G 7 
20 o GC G7CGCCG^ „-7 ^GCCG wGCC'o ; -j: ; oTo7'- Cr.CoG ; j i'w'-'w'oi^^CvGvCy ^ 'J 

A V A G 3PGR7ETG? V ? A 
G I 1 3 3 G 7 A G 7 G 7 G G G G T 3 T T C A. 3 C G 7 C G .»~A C 3 G G 7 I 1 — wj^CoCCG A ^ j, _ J 
G A u R A V F G A E 1 A L A A A D 

25 A T 3 C A T 7 E Q 1 3 V T 5 7 P G 

CGGATCCG CCCGCGG-CAG^GGCCA^CCGC^jw A.ljA.'^-v- : oo ^ i ^ o.~ - vjhA.CC^ j 0 0 

3 3 A R G R A 7 A Q 7 W 7 3 E ? 
73 GG 3 3 AG G G G GG G C G C C G C 7 TC AC CGT C C AC AC C C GCG T C 3 G 7 G ACG C G 3 3 50 
A A 1) G R R R F T 7 H T R V G D A 
30 CGG7GGACGCTGCACG7CGAGGGGGT7CTCCGCCCCGGCCGCGTGCCCCA 34 GO 
P WTLHAEGVLRPGRVPQ 
GC3C3AAGCCGTCGACACCGCCTGGCCCCCGCCGGGCGCGGTGCCCGCGG 34 50 

? F A 7 D T A W P P ? G A V P A 
ACG 3 3 3 TG 7 C 7G GG G GG T GG C GACG C G G G G.AC G AGG 77 7 7 C G T C G AAG C G 3 500 
35 D G L ? G A W R R A D Q V F V E A 

GAAGTCGA,CAGCCCT GACGGCT TCGTGGC AC AC CCCGACCTGCTCGACGG 35 50 

F 7 0 S P D G F 7 A H P C 1 1 D A 
GGTC777TGGGGGG7GGGCGACGGGAGCCGCC.AGCCGAGCGGA7GGCGCG 3600 
V F S A V GDG5RQPT G W R 
40 A 7 777 G 7 G GT G GAG G CGT C G GAG GC C ACCG G 1 GCT GG GCG C C T G G C T C A.CC 3 650 
u L A 7' HAS D A 7 7 IRA C ^ T 
C 3 CC G GG AC AG 7 G 3 T GTC G T GG AGC TCG C C G CCT TC GAC G GT G 7 C G G AAT 3 7 00 

R R 3 S G 7 V E 1 A A F 3 G A G K 
GCCGGTGCTCACCGCGGAGTCGGTGACGCTGGGCGAGGTCGCGTCGGCAG 3 "50 
45 ? 7 1 T A E S V T L G E 7 A S A 

GCGGATCCGACGAGTCGGACGGTCTGCTTCGGCTTGAGTGGTTGCCGGTG 3 3 00 
G G S D E S D G L 1 R 1 E W 1 P V 
GCGGAGGCCCACTACGACGGTGCCGACGAGCTGCCCGAGGGC7ACACCCT 38 50 
AEAHYDGADELPEGYTL 
50 C A 7 C A G C G C C AC A.C AC C C C G ACG AC C C C G AC G AG C C C AC C AAC C CCCACA 3 900 
I T A T H P D D P D 3 P 7 N P H 
AGACAGGCACACGCAGCCACACACA^ACCACACGCGTGCTCACGGCCCTG 3 350 
N 1 7 P T R T H T Q T T R 7 1 T A L 
CAACACCACCTCATCACCACC;i/\CCACACCCTCAT3GTCCACACCACCAC -10 00 
55 Q H H L I T T N H T L I V H T T T 

CGACCCCCCAGGCGCCGCCGTCACCGGCCTCACCCGCACCGCACAAAACG 4 0 50 

DPPGAAVTGLTRTAQK 
AAC ACCCCGGCCGC AT C C AC C T C AT C G AAAC C C AG C ACC C C C AC AC CC C A 4100 
E K P G R I H L I E T H H ? H T P 
60 GTGCGGGTCACCCAAGTCAGCACGGTGCACCA.ACCGCAGCTACGGGTCAC 4 150 
1 P 1 T Q L T T L H Q ? H L R L T 
CAAGrAGACCCTCCACACCCCCCACCTCACCCCCATCACCACCCACCACA 4 200 

NNTLHTPHLTPXTTHH 
ACACCACCACAACCACCCCCAACACCCCACCCCTCAACCCCAACCACGCC 4 2 50 

104 



WO 00/20601 



PCT/US99/22886 



N T T T T T ? N T ? r G M P V. H A 
T - L I G G S G 7 G A GIG .A R H 

5 G M H p H 7 L G ^ R. ? P P ? ? 




15 H T 7 N 0 ? 



G S ? G G ^ N V A M A 




D R D R : R R G G P G ? I 



25 ---„--^^-^^^- 

E xample 3 

Recombinant PKS Genes for 13-desmethoxy FK-506 and FK-520 
The present invention provides a variety of recombinant PKS genes in addition to 

30 those described in Examples 1 and 2 for producing 13-desmethoxy FK-506 and FK-520 
compounds. This Example provides the construction protocols for recombinant FK-520 
and FK-506 (from Streptomyces sp. M A6S58 (ATCC 55098), described in U.S. Patent 
Nos. 5,1 16,756, incorporated herein by reference) PKS genes in which the module 8 AT 
coding sequences have been replaced by either the rapA73 (the AT domain from module 

35 3 of the rapamycin PKS), m/?AT12, ervATl (the AT domain from module 1 of the 
erythromycin (DEBS) PKS), or cry AT2 coding sequences. Each of these constructs 
provides a PKS that produces the 13-desmethoxy- 13-methyl derivative, except for the 
rapAT12 replacement, which provides the 13-desmethoxy derivative, i.e., it has a 
hydrogen where the other derivatives have methyl. 

40 Figure 7 shows the process used to generate the AT replacement constructs. 

First, a fragment of -4.5 kb containing module S coding sequences from the FK-520 
cluster of ATCC 14891 was cloned using the convenient restriction sites Sad and Sph\ 
(Step A in Figure 7). The choice of restriction sites used to clone a 4.0 - 4.5 kb fragment 
comprising module 8 coding sequences from other FK-520 or FK-506 clusters can be 

45 different depending on the DNA sequence, but the overall scheme is identical. The 

unique Sad and Sph\ restriction sites at the ends of the FK-520 module 8 fragment were 
then changed to unique Bgl II and Nsil sites by ligation to synthetic linkers (described in 
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the preceding Examples, see Step B of Figure 7). Fragments containing sequences 5' and 
-V of the ATS sequences were then amplified using primers, described above, that 
introduced either an Avrll sue or an Xhel site at two different KS. AT boundaries and an 
Xhol site at the AT 1)1 1 boundary ( Step C of Figure Heterologous AT domains from 
5 the rapamycin and erythromycin gene clusters were amplified using primers, as 

described above, that introduced the same sites as just described (Step D of Figure 7). 
The fragments were ligated to give hybrid modules with in- frame fusions at the KS/AT 
and AT 'DM boundaries (Step E of Figure 7). Finally, these hybrid modules were ligated 
into the Ha mill and Psrl sites of the KC515 vector. The resulting recombinant phage 
10 we r " used to transform the FK-506 and FK-520 producer strains to yield the desired 
recombinant cells, as described in the preceding Examples. 

The following table shows the location and sequences surrounding the engineered 
site of each of the heterologous AT domains employed. The FK-506 hybrid construct 
was used as a control for the FK-520 recombinant ceils produced, and a similar FK-520 
15 hybrid construct was used as a control for the FK-506 recombinant cells. 



Heterologous AT 


Enzyme 


Location of Engineered Site 


FK-506 ATS 
(hydroxymalonyl) 


Avrll 
Nhel 


G R F R R A A V S S F 

C t\ ^ ^ .*-. . -~ o ^ 0 ~. . <~ ' j> o - w J-\ GU'oQCCCQCC 

T Q H ? A M G E R L A 




Xhol 


YA^GGG. - w ...-.oCooL G'sjCC ACTCjGci t: cqs ct 


rapamycin AT3 


.-hrll 




(methylmalonyl) 


Nhel 


D R r ? R A G V S S F 

i 

TGGGAGTG^GGGGGGAY^GGGAGTGGcctacqG : 
W Q W L G X G S A L R ; 




XJwl 


TAG GZCZ^TZ Z AA GAG G A. G G G G T AC T G G a t c c a a 
Y A F Q K C R Y W V E 


rapamycin AT 12 
* 

(malonyl) 


Avrll 
Nhel 


GGCGGA. 3 c 3 c e Go-j-^ ^ GT'o TCGT GGTTC 
G R A R A G V S S F 

S Q K A G X G E E L A 




Xhol 


T A.G i j3 G G G i lAo ^ w :^G l ± AC TCGciCQaC 
Y A F Q H Q R Y W L E 


| DEBS ATI 
(methylmalonyl) 


Avrll 
Nhel 


G G G G G A. cccj i' ci c G GG ^GGGGGGTGTCGTCGTTC 

A R ? R R A G V S S F 
TGGCAGTGGGGGGGGATGGCGGTCGAce t act C 




W Q W A G M A V DLL 




XJwl 


T AC GC G G T C C AGCGCG A.GCG CGTCTGG c t c a a a 
Y F F Q R £ R V W L E 


DEBS AT 2 


j Avrll 


GACGGGqtccacCGGGCAGGTGTGTCGGCGTTC 


(methylmalonyl) 


1 


D G V R R A G V S A F 
GC C G AG T G G G AAG GC AT GG C G C G G G Aa 1 1 a 1 1 G 
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Nhel 


A Q W E G M A R E L L 

TATCCTTTCCAGGGCAAGCGGTTCTGGc-qcta 




Xhol 


Y P F Q G K R F W L L 
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The sequences shown below provide the location of the KS/AT boundaries 
chosen in the FK-520 module 8 coding sequences. Regions where Avrll and Nhel sites 
were engineered are indicated by lower case and underlining. 









T 


P A 


P 1- 


- G'vj 

V; 


r P 


T P 


v_ 'o w 

R 


;:cacggC 
P R 




gtgcccc 






GTGAGC"! 


;GCACC 




!GCC CA 




CT P 


n ^ r 1 ^ / ~" 


R 


\ 3 


J) 


F G 


v s 


G T 


u 


A H 


V I 


P 


P A 


GACCGG7A 


AGGGAGA 






3GATGGC 


:gttpg 


GGT 


'GA/C P7 


_ i _ '. G 'o 


CT G 


'o I L:TC\jG 




T E 


T 


? A 


A S 


P s 


G 


D P 


? P 




v s 


CACGCTCA 


CGGGAAG 








:TGGG G 






GGCCTAG 


CTG 


GACACCA 


A R S 


P E 




L D 


E Q 


I R 


R 


P R 


a y 


L 


D T 


CCCGGGAC 


'jTvu .*-\ w G 


■o ^ 


: r G G P 


v; - '.I'vjCrv.. 




GGG 


iGCCCG 


GG<oGAGA 




TTCGCCC 




V D 


R 


V A 




Q 7 




A R 


P T 


H 


P A 








? G T G A G 


r, „ U o j. w / 












CCCGACG 








G D 








? p 


A P 


P 


P D 


AACTCGCC 


T7CGTCT 




'CCGGC 














CAa::cq 


E L V 


F V 




S G 


Q G 








V n 


P 


P L 


cCGCCGGG 


CATGGGG 




TCGCC 


GAGGCGT 


'GvjGA.T 




.GCGCT 


i « \_- 'o ^ '-^ 


-j, r- 


GACAACC 


A A A 


F P 


V 


r n 


D A 


r*7 '„ ; 


IT 


n ^ 


R R 


T 


P N 



The sequences shown below provide the location of the AT/DH boundary chosen 
in the FK-52u module 8 coding sequences. The region where an Xhol site was 
engineered is indicated by lowercase and underlining. 

TCCCCGGGG CTGGGTC ACGGC ACGAiCGCGGAToTGCCCG'-G T ACoCb T GC AAGGGCGljv. 
I L G A G S R H D A D V ? A Y A F Q R R 
ACTACTGGa t eg ag TCGGCACGCCCGGCCGC AT GGG ACGCGGGCC ACCCCGTGCTGGGCT 
H Y W ~~ T" " E ' SARFAASDAGKPVLG 

The sequences shown below provide the location of the KS/AT boundaries 
chosen in the FK-506 module 8 coding sequences. Regions where .4vrII and Mi el sites 
were engineered are indicated by lower case and underlining. 

TCGGCCAGGCCGTGGCCGCGGACCGGCCGTcco-cacCGTGCGGCGGTCTCGTCGTTCGGG 

5 A R p W P R T G R P R R A A V S S F G 
GTGAGCGGCACCAACGCCCACATCATCCTGGAGGCCGGACCCGACCAGGAGGAGCCGTCG 

VSGTNAHIILEAGPDQEEPS 
GCAGAACCGGCCGGTGACCTCCCGCTGCTCGTGTCGGCACGGTCCCCGGAGGCACTGGAC 

A E ? A G D L ? P L V S A R S P E A L D 
G AGCriG ATCGGGCGCCT GCGCGAC T AT CTCGACGCCGCCCCC GGCGTGij AC CTGuCuG^C 
SQIGRLRDYLDAAPGVDLAA 
GTGGCGCGGACACTGGCCACGCGTACGCACTTCTCCCACCGCGCCGTACTGCTCGGTGAC 

V A R T u A T R T H F S H R A V L h G D 
ACCGTCATCACCGCTCCCCCCGTGGAACAGCCGGGCGAGCTCGTCTTCGTCTACTCGGGA 

TVITAPPVEQPGELVFVYSG 
CAGGGCACCCAGCATCCCGCGATGGGTGAGCGocrcccCGCAGCCTTCCCCGTGTTCGCC 

QGTQH PAMGERLAAAFPVFA 
GACCCGGACGTACCCGCCTACGCCTTCCAGCGGCGGCCCTACTGGATCGAGTCCGCGCCG 

DPDVPAYAFQRRPYWIESAP 

The sequences show f n below provide the location of the AT/DH boundary chosen 
in the FK-506 module 8 coding sequences. The region where an Xhol site was 
engineered is indicated by lower case and underlining. 

GACCCGGACGTACCCGCCTACGCCTTCCAGCGGCGGCCCTACTGG atcqag TCCGCGCCG 
DPDVPAYAFQRRPYWIESAP 
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Example 4 

R eplacement of Methoxyl with Hydrogen or Methyl at C-15 of FK-506 and FK-520 
The methods and reagents of the present invention also provide novel FK-506 
5 and FK-520 derivatives in which the methoxy group at C-15 is replaced by a hydrogen or 
methyl. These derivatives are produced in recombinant host cells of the invention that 
express recombinant PKS enzymes the produce the derivatives. These recombinant PKS 
enzymes are prepared in accordance with the methodology of Examples 1 and 2, with the 
exception that AT domain of module 7, instead of module S, is replaced. Moreover, the 
10 present invention provides recombinant PKS enzvm^s in which the AT domains of both 
modules 7 and S have been changed. The table below summarizes " e various 
compounds provided by the present invention. 





Compound 


C-13 


C-15 


Derivative Provided 


15 


FK-506 


hydrogen 


hydrogen 


13, 15-didesmethoxy-FK-506 




FK-506 


hydrogen 


methoxy 


1 3-desmethoxy-FK-506 




FK-506 


hydrogen 


methyl 


13,15-didesmethoxy-15-methyl-FK-506 




FK-506 


methoxy 


hydrogen 


1 5-desmethoxy-FK-506 




FK-506 


methoxy 


methoxy 


Original Compound - FK-506 


20 


FK-506 


methoxy 


methyl 


1 5-desmethoxy- 1 5-methyl-FK-506 




FK-506 


methyl 


hydrogen 


13,1 5-didesmethoxy- 1 3-methyl-FK-506 




FK-506 


methyl 


methoxy 


13-desmethoxy-13-methyhFK-506 




FK-506 


methyl 


methyl 


13,1 5 -didesmethoxy- 13,1 5-dimethyl-FK-506 




FK-520 


hydrogen 


hydrogen 


13, 15-didesmethoxy FK-520 


25 


FK-520 


hydrogen 


methoxy 


13-desmethoxy FK-520 




FK-520 


hydrogen 


methyl 


13,15 -didesmethoxy- 1 5-methy l-FK-520 




FK-520 


methoxy 


hydrogen 


1 5-desmethoxy-FK-520 




FK-520 


methoxy 


methoxy 


Original Compound - FK-520 




FK-520 


methoxy 


methyl 


1 5 -desmethoxy- 1 5 -methyl-FK-520 


30 


FK-520 


methyl 


hydrogen 


13,15-didesmethoxy~13-methyl-FK-520 




FK-520 


methyl 


methoxy 


13-desmethoxy-I3-methyl-FK-520 




FK-520 


methyl 


methyl 


13,15 -didesmethoxy- 13,1 5-dimethy I-FK-520 



Example 5 

35 Replacement of Methoxyl with Ethvl at C-13 and/or C-15 of FK-506 and FK-520 
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The present invention also provides novel FK-506 and FK-520 derivative 
compounds in which the methoxy groups at either or both the CM 3 and C-15 positions 
are instead ethyl groups. These compounds are produced by novel PKS enzymes of the 
invention in which the AT domains of modules 8 and/or 7 are converted to ethylmalonyl 
specific AT domains by modification of the PKS gene that encodes the module. 
Ethylmalonyl specific AT domain coding sequences can be obtained from, for example, 
the FK-520 PKS genes, the niddamycin PKS genes, and the tylosin PKS genes. The 
novel PKS genes of the invention include not only those in which either or both of the 
AT domains of modules 7 and 8 have been converted to ethylmalonyl specific AT 
-omains bin ai^ e in which one of the modules is conv. 1 ■ an ethylmalonyl 
specific AT domain and the other is convened to a malonyl specific or a methylmalonyl 
specific AT domain. 

Example 6 
Neurotrophic C o mpounds 

The compounds described in Examples 1 - 4, inclusive have immunosuppressant 
activity and can be employed as immunosuppressants in a manner and in formulations 
similar to those employed for FK-506. The compounds of the invention are generally 
effective for the prevention of organ rejection in patients receiving organ transplants and 
in particular can be used for immunosuppression following orthotopic liver 
transplantation. These compounds also have pharmacokinetic properties and metabolism 
that are more advantageous for certain applications relative to those of FK-506 or FK- 
520. These compounds are also neurotrophic; however, for use as neurotrophic, it is 
desirable to modify the compounds to diminish or abolish their immunosuppressant 
activity. This can be readily accomplished by hydroxylating the compounds at the C-18 
position using established chemical methodology or novel FK-520 PKS genes provided 
by the present invention. 

Thus, in one aspect, the present invention provides a method for stimulating 
nerve growth that comprises administering a therapeutically effective dose of 18- 
hydroxy-FK-520. In another embodiment, the compound administered is a C-18, 20- 
dihydroxy-FK-520 derivative. In another embodiment, the compound administered is a 
C-13-desmethoxy and/or C-15-desmethoxy 1 8-hydroxy-FK-520 derivative. In another 
embodiment, the compound administered is a C-13-desmethoxy and/or C-15- 
desmethoxy 18,20-dihydroxy-FK-520 derivative. In other embodiments, the compounds 
are the corresponding analogs of FK-506. The 18-hydroxy compounds of the invention 
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can be prepared chemically, as described in U.S. Patent No. 5,189,042, incorporated 
herein by reference, or by fermentation of a recombinant host cell provided by the 
present invention that expresses a recombinant PKS in w hich the module 5 DH domain 
has been deleted or rendered non-functional. 

The chemical methodology is as follows. A compound of the invention (-200 
mg) is dissolved in 3 mL of dry methylene chloride and added to 45 fiL of 2,6-lutidine, 
and the mixture stirred at room temperature. After 10 minutes, tert-butyldimethylsilyl 
trifluoromethanesulfonate (64 jiL) is added by syringe. After 15 minutes, the reaction 
mixture is diluted with ethyl acetate, washed with saturated bicarbonate, washed with 
brine, and the organic phase dried over magnesium sulfate. Removal of solvent in vacuo 
and flash chromatography on silica gel (ethyl acetate: hexane ( 1 :2) plus 1% methanol) 
gives the protected compound, which is dissolved in 95% ethanol (2.2 mL) and to which 
is added 53 uL of pyridine, followed by selenium dioxide (58 mg ). The flask is fitted 
with a water condenser and heated to 70°C on a mantle. After 20 hours, the mixture is 
cooled to room temperature, filtered through diatomaceous earth, and the filtrate poured 
into ? saturated sodium bicarbonate solution. This is extracted with ethyl acetate, and the 
organic phase is washed with brine and dried over magnesium sulfate. The solution is 
concentrated and purified by flash chromatography on silica ge! (ethyl acetate: hexane 
(1:2) plus 1% methanol) to give the protected 18-hydroxy compound. This compound is 
dissolved in acetonitrile and treated with aqueous HF to remove the protecting groups. 
After dilution with ethyl acetate, the mixture is washed with saturated bicarbonate and 
brine, dried over magnesium sulfate, filtered, and evaporated to yield the 18-hydroxy 
compound. Thus, the present invention provides the C-lS-hydroxyl derivatives of the 
compounds described in Examples 1 - 4. 

Those of skill in the art will recognize that other suitable chemical procedures can 
be used to prepare the novel 18-hydroxy compounds of the invention. See, e.g., Kawai et 
a/., Jan. 1993, Structure-activity profiles of macrolactam immunosuppressant FK-506 
analogues, FEES Letters 316(2): 107-1 13. incorporated herein by reference. These 
methods can be used to prepare both the C18-[S]-OH and C18-[/?]-OH enantiomers, with 
the R enantiomer showing a somewhat lower IC50, which may be preferred in some 
applications. See Kawai et ai, supra. Another preferred protocol is described in Umbreit 
and Sharpiess, 1977, JACS 99(16): 1526-28, although it may be preferable to use 30 
equivalents each of Se0 2 and t-BuOOH rather than the 0.02 and 3-4 equivalents, 
respectively, described in that reference. 
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All scientific and patent publications referenced herein are hereby incorporated 
by reference. The invention having now been described by way of written description 
and example, those of skill in the art will recognize that the invention can be practiced in 
a variety of embodiments, that the foregoing description and example is for purposes of 
5 illustration and not limitation of the following claims. 
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Claims 

1 . An isolated nucleic acid that encodes a CoA ligase, a non-rihosomal peptide 
synthetase, or a domain of an extender module of a polyketide synthase enzyme that 
synthesizes FK-520. 

5 

2. The isolated nucleic acid of claim 1 that encodes an extender module, said 
module comprising a ketosynthase domain, an acyl transferase domain, and an acyl 
carrier protein domain. 

10 3. The isolated nucleic acid of claim 1 that encodes an open reading frame said 

open reading frame comprising coding sequent. for two or more extender modules, 
each extender module comprising a ketosynthase domain, an acyl transferase domain, 
and an acyl earner protein domain. 

15 4. The isolated nucleic acid of claim 1 that encodes a gene cluster, said gene 

cluster comprising two or more open reading frames, each of said open reading frames 
comprising coding sequences for two or more extender modules, each of said extender 
modules comprising a ketosynthase domain, an acyl transferase domain, and an acyl 
carrier protein domain. 

20 

5. The isolated nucleic acid of claim 2, wherein at least one of said domains is a 
domain of a module of a non-FK-520 polyketide synthase. 

6. The isolated nucleic acid of claim 1, wherein said nucleic acid is a 

25 recombinant vector capable of replication in or integration into the chromosome of a host 
cell. 

7. The isolated nucleic acid of claim 6 that is selected from the group consisting 
of cosmid pKOS034-120, cosmid pKOS034-124, cosmid pKOS065-M27, and cosmid 

30 pKOS065-M21. 

8. The isolated nucleic acid of claim 5, wherein said non-FK-520 polyketide 
synthase is rapamycin polyketide synthase, FK-506 polyketide synthase, or erythromcyin 
polyketide synthase. 

35 
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9. A method of preparing a polyketide, said method comprising transforming a 
host cell with a recombinant DNA vector of claim 6, and culturing said host cell under 
conditions such that said polyketide synthase is produced and catalyzes synthesis of said 
polyketide. 

1 0. The method of claim 9, wherein said host cell is a Streptomyces host cell. 

1 1. The method of claim 9, wherein said polyketide is selected from the group 
consisting of FK-520, 13-desmethoxy-FK-520, and 13-desmethoxy-FK~506. 

12. A recombinant host cell that expresses a recombinant polyketide synthase 
selected from the group consisting of: (i) an FK-520 polyketide synthase in which at 
least one AT domain is replaced by an AT domain of a non-FK-520 polyketide synthase; 
(ii) an FK-506 polyketide synthase in which at least one AT domain is replaced by an 
AT domain of a non-FK-506 polyketide synthase; (iii) an FK-520 polyketide synthase in 
which at least one DH domain has been deleted; (iv) an FK-506 polyketide synthase in 
which at least one DH domain has been deleted. 

13. The recombinant host cell of claim 12 that expresses an FK-520 polyketide 
synthase in which an AT domain of module 8 has been replaced by an AT domain that 
binds malonyl CoA, methylmalonyl CoA, or ethylmalonyl CoA. 

14. The recombinant host cell of claim 12 that expresses an FK-506 polyketide 
synthase in which an AT domain of module 8 has been replaced by an AT domain that 
binds malonyl CoA, methylmalonyl CoA, or ethylmalonyl CoA. 

15. The recombinant host cell of claim 13, wherein a DH domain of module 5 or 
module 6 has been deleted. 

16. The recombinant host cell of claim 14, wherein a DH domain of module 5 or 
module 6 has been deleted. 

17. A recombinant host cell that comprises recombinant genes coding for 
enzymes sufficient for synthesis of ethylmalonyl CoA or 2-hydroxymalonyl CoA. 
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wherein, R| is hydrogen, methyl, ethyl, or allyl; R 2 is hydrogen or hydroxyl, provided 
that when R 2 is hydrogen, there is a double bond between C-20 and C-19; R3 is hydrogen 
or hydroxyl; R4 is methoxyl, hydrogen, methyl, or ethyl; and Rs is methoxyl, hydrogen, 
methyl, or ethyl; but not including FK-506, FK-520, 18-hydroxy-FK-520, and 18- 
hydroxy-FK-506. 

19. The polyketide of claim 18 that is 13-desmethoxy-FK-506. 

20. The polyketide of claim 18 that is 13-desmethoxy-18-hydroxy-FK-520. 
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POLYKETIDE SYNTHASE ENZYMES AND RECOMBINANT DNA CONSTRUCTS 

THEREFOR 



Field of the In vemi o 
The present invention relates ro polyketides and the polyketide svnrhase (PKS) 
enzymes that produce them, The invention also relates generally to genes encoding PKS 
enzymes and to recombinant host cells containing such genes and in which expression of 
such genes leads to the production of polyketides. The present invention also relates to 
cuinpounds useful as medicaments having immunosuppressive and/or neurotrophic activity. 
Thus, the invention relates to the fields of chemistry, molecular biology, and agricultural, 
medical, and veterinary technology. 

Background of the Invc mi o n 

Polyketides are a class of compounds synthesized from 2-carbon units through a 
series of condensations and subsequent modifications. Polyketides occur in many types of 
organisms, including fungi and mycelial bacteria, in particular, the actmomycetes. 
Polyketides are biologically active molecules with a wide variety of structures, and the class 
encompasses numerous compounds with diverse activities. Tetracycline, erythromycin, 
epothilone, FK-506, FK-520, narbomycin. picromycin. raparnycin, spinocyn. and tylosin are 
examples of polyketides. Given the difficulty in producing polyketide compounds by 
traditional chemical methodology, and the typically low production of polyketides in wild- 
type cells, there has been considerable interest in finding improved or alternate means to 
produce polyketide compounds. 

This interest has resulted in the cloning, analysis, and manipulation by recombinant 
DNA technology of genes that encode PKS enzymes. The resulting technology allows one 
to manipulate a known PKS gene cluster either ... ; -oduce the polyketide synthesized by 
that PKS at higher levels than occur in nature or in hosts that otherwise do not produce the 
polyketide. The technology also allows one to produce molecules that are structurally 
related to. but distinct from, the polyketides produced from known PKS gene clusters. See, 
e.g., PCT publication Nos. WO 93/13663; 95/0S54S; 96/40968; 97/02358; 98/27203; and 
98/49315; United States Patent Nos. 4,874.748; 5.063,155; 5,098.837; 5,149,639; 
5,672,491; 5,712.146; 5,830,750; and 5,843,718; and Fu et aL, 1994, Biochemistry; 33: 



SUBSTITUTE SHEET (RULE 26) 



WO 00/20601 PCT/US99/22886 



032] -9326; McDaniel ei aU 1993, Science 262: 1546-1550; and Rohi\ 1995, Angeu: Chem. 
Int. Ed Enzi. 3-/<S): SS1-S8S. each of which is incorporated herein by reference. 

Poivkeudes are synthesized in nature by PKS enzymes. These enzymes, which are 
complexes of multiple large proteins, are simiiar to the synthases that catalyze condensation 
5 of 2-carbon units in the biosynthesis of fatty acids. PKSs catalyze the biosynthesis of 

poivkeudes through repeated, decarboxylase Claisen condensations between acylthioester 
budding blocks. The building blocks used to form complex poiyketides are typically 
acykhioesters, such as acetyl, butyryl, propionyl. malonyl, hydroxymalonyl, 
methylmalonyh and ethylmalonyl CoA. Other building blocks include amino acid like 

10 acylthioesters. PKS enzymes that incorporate such building blocks include an activity that 
functions as an amino acid ligase (an AMP ligase} or as a non-ribosomal peptide synthetase 
(NRPS). Two major types of PKS enzymes are known; these differ in their composition and 
mode of synthesis of the polyketide synthesized. These two major types of PKS en/Ames 
are commonly referred to as Type I or "modular" and Type II "iterative" PKS enzymes. 

15 In the Type I or modular PKS enzyme group, a set of separate catalytic active sites 

(each active site is termed a "domain", and a set thereof is termed a "module") exists for 
each cycle of carbon chain elongation and modification in the polyketide synthesis pathway. 
The typical modular PKS is composed of several large polypeptides, which can be 
segregated from amino to carboxy termini into a loading module, multiple extender 

20 modules, and a releasing <or thioesterase) domain. The PKS enzyme known as 6- 

deoxyemhronohde B synthase (DEBS) is a Type I PKS. In DEBS, there is a loading 
module, six extender modules, and a thioesterase (TE) domain. The loading module, six 
extender modules, and TE of DEBS are present on three separate proteins (designated 
DEBS- 1 , DEBS-2, and DEBS-3, with two extender modules per protein). Each of the 

25 DEBS polypeptides is encoded by a separate open reading frame (ORE) or gene; these 

genes are known as eiyAl eiyAll and eryAUL See Caffrey et al, 1992, FEBS Letters 304: 
205. and U.S. Patent No. 5,S24,513, each of which is incorporated herein by reference. 

Generally, the loading module is responsible for binding the first building block 
used to synthesize the polyketide and transferring it to the first extender module. The 

30 loading module of DEBS consists of an acyltransferase (AT) domain and an acyl carrier 

protein (ACP) domain. Another type of loading module utilizes an inactivated ketosynthase 
(KS) domain and AT and ACP domains. This inactivated KS is in some instances called 
KS Q where the superscript letter is the abbreviation for the amino acid, glutamine, that is 
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present instead of the active site cysteine required for keiosynthase activity. In other PKS 
enzymes, including the FK-506 PKS, the loading module incorporates an unusual starter 
unit and is composed of a Co A ligase like activity domain. In any event, the loadine module 
recognizes a particular acyl-CoA (usually acetyl or propionyi but sometimes butyryl or 
5 other acyl-CoA) and transfers it as a thiol ester to the ACT of the loading module. 

The AT on each of the extender modules recognizes a particular extender-CoA 
(malonyi or alpha-substituted malonyl, i.e.. methylmalonyl, ethylmalonyh and 2- 
hydroxymalonyl) and transfers it to the ACP of that extender module to form a thioester. 
Each extender module is responsible for accepting a compound from a pnor moame. 

10 binding a building block, attaching the building olock to the compound from the pnor 
module, optionally performing one or more additional functions, and transferring the 
resulting compound to the next module. 

Each extender module of a modular PKS contains a KS, AT, ACP, and zero, one, 
two, or three domains that modify the beta-carbon of the growing polyketide chain. A 

15 typical (non-loading) minimal Type I PKS extender module is exemplified by extender 

module three of DEBS, which contains a KS domain, an AT domain, and an ACP domain. 
These three domains are sufficient to activate a 2-carbon extender unit and attach it to the 
growing polyketide molecule. The next extender module, in turn, is responsible for 
attaching the next building block and transferring the growing compound to the next 

20 extender module until synthesis is complete. 

Once the PKS is primed with acyl- and malonyl-ACPs. the acyl group of the loading 
module is transferred to form a thiol ester (trans-estcrification) at the KS of the first 
extender module; at this stage, extender module one possesses an acyl-KS and a malonyl (or 
substituted malonyl) ACP. The acyl group derived from the loading module is then 

25 covalently attached to the alpha-carbon of the malonyl group to form a carbon-carbon bond, 
driven by concomitant decarboxylation, and generating a new acyl-ACP that has a backbone 
two carbons longer than the loading building block (elongation or extension). 

The polyketide chain, growing by two carbons each extender module, is sequentially 
passed as covalently bound thiol esters from extender module to extender module, in an 

30 assembly line-like process. The carbon chain produced by this process alone would possess 
a ketone at every other carbon atom, producing a polyketone, from which the name 
polykeiide arises. Most commonly, however, additional enzymatic activities modify the beta 
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keio group of each two carbon unn just after it has been added to the growing polvkeiide 
chain but before it is transferred to the next module. 

Thus, in addition to the minimal module containing KS. AT. and ACP domains 
necessary to form the carbon-carbon bond, and as noted above, other domains that modifv 
5 the beta-carbonyi moiety can be present. Thus, modules may contain a ketoreductase {KR) 
domain that reduces the keto group to an alcohol. Modules may also contain a KR domain 
plus a dehydratase CDH) domain that dehydrates the alcohol to a double bond. Modules may 
also contain a KR domain, a DH domain, and an enoy [reductase ( ER) domain that converts 
the double b^A produci to a saturated single bond using the beta carbon as a methylene 

10 function. An extender module can also contain other enzymatic activities, such as, for 
example, a methylase or dimethylase activity. 

After traversing the final extender module, the polyketide encounters a releasing 
domain that cleaves the polyketide from the PKS and typically cychzes the polyketide. For 
example, final synthesis of 6-dEB is regulated by a TE domain located at the end of 

15 extender module six. In the synthesis of 6-dEB, the TE domain catalyzes cyclization of the 
macrolide ring by formation of an ester linkage. In FK-506, FK-520, rapamycin, and similar 
polyketides, the TE activity is replaced by a RapP (for rapamycin) or RapP like activity that 
makes a linkage incorporating a pipecolate acid residue. The enzymatic activity that 
catalyzes this incorporation for the rapamycin enzyme is known as RapP, encoded by the 

20 rapP gene. The polvketide can be modified further by tailoring enzymes; these enzymes add 
carbohydrate groups or methyl groups, or make other modifications, i.e.. oxidation or 
reduction, on the polyketide core molecule. For example, 6-dEB is hydroxylated at C-6 and 
C-12 and glycosylated at C-3 and C-5 in the synthesis of erythromycin A. 

In Type I PKS polypeptides, the order of catalytic domains is conserved. When all 

25 beta-keto processing domains are present in a module, the order of domains in that module 
from N-to-C-terminus is always KS, AT, DH, ER, KR, and ACP. Some or all of the beta- 
keto processing domains may be missing in particular modules, but the order of the domains 
present in a module remains the same. The order of domains within modules is believed to 
be important for proper folding of the PKS polypetides into an active complex. Importantly, 

30 there is considerable flexibility in PKS enzymes, which allows for the genetic engineering 
of novel catalytic complexes. The engmeenng of these enzymes is achieved by modifying, 
adding, or deleting domains, or replacing them with those taken from other Type I PKS 
enzymes. It is also achieved by deleting, replacing, or adding entire modules with those 



SUBSTITUTE SHEET (RULE 26) 



WO 00/20601 PCT/US99/22886 

taken from other sources. A genetically engineered PKS complex should of course have the 
ability to catalyze the synthesis of the product predicted from the genetic alterations made. 

Alignments of the many available ammo acid sequences for Type I PKS enzvmes 
has approximately defined the boundaries of the various catalytic domains. Sequence 
alignments also have revealed linker regions between the catalytic domains and at the N- 
and C-termini of individual polypeptides. The sequences of these linker regions are less 
well conserved than are those for the catalytic domains, which is in part how linker regions 
are identified. Linker regions can be important for proper association between domains and 
between .he : .auai polypeptides that comprise ::;e P l: "S c miplex. One can thus vie\ 
linkers and domains together as creating a scaffold on which the domains and modules are 
positioned in the correct orientation to be active. This organization and positioning, if 
retained, permits PKS domains of different or identical substrate specificities to be 
substituted (usually at the DNA level) between PKS enzymes by various available 
methodologies. In selecting the boundaries of, for example, an AT replacement, one can 
thus make the replacement so as to retain the linkers of the recipient PKS or to replace them 
with the linkers of the donor PKS AT domain, or, preferably, make both constructs to 
ensure that the correct linker regions between the KS and AT domains have been included 
in at least one of the engineered enzymes. Thus, there is considerable flexibility in the 
design of new PKS enzymes with the result that known polyketides can be produced more 
effectively, and novel polyketides useful as pharmaceuticals or for other purposes can be 
made. 

By appropriate application of recombinant DNA technology, a wide variety of 
polyketides can be prepared in a variety of different host cells provided one has access to 
nucleic acid compounds that encode PKS proteins and polyketide modification enzymes. 
The present invention helps meet the need for such nucleic acid compounds by providing 
recombinant vectors that encode the FK-520 PKS enzyme and various FK-520 modification 
enzymes. Moreover, while the FK-506 and FK-520 polyketides have many useful activities, 
there remains a need for compounds with similar useful activities but with better 
pharmacokinetic profile and metabolism and fewer side-effects. The present invention helps 
meet the need for such compounds as well. 

Summary of the Invention 
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In one embodiment, the present invention provides recombinant DNA vectors that 
encode all or part of the FK-520 PKS enzyme. Illustrative vectors of the invention include 
eosmia pKOSO_*i4-l2u. pKOS034-124. pKOSu65-C31. pKOS(Jto-C2\ pKOS065-M27, and 
pKOS065-M21 . The invention also provides nucleic acid compounds that encode the 
5 various domains of the FK-520 PKS, i.e., the KS. AT. A CP, KR. Dll and ER domains. 
These compounds can be readily used, alone or in combination with nucleic acids encoding 
other FK-520 or non-FK-520 PKS domains, as intermediates in the construction of 
recombinant vectors that encode all or part of PKS enzymes that make novel polyketides. 
The invention also provides isolated nucleic acids that encode all or pan of one or 

1 0 more modules of the FK-520 PKS, each module comprising a ketosynthase activity, an acvl 
transferase activity, and an acyl carrier protein activity. The invention provides an isolated 
nucleic acid that encodes one or more open readme frames of FK-520 PKS genes, said open 
reading frames comprising coding sequences for a CoA hgase activity, an NRPS activity, or 
two or more extender modules. The invention also provides recombinant expression vectors 

1 5 containing these nucleic acids. 

In another embodiment, the invention provides isolated nucleic acids that encode all 
or a part of a PKS that contains at least one module in which at least one of the domains in 
the module is a domain from a non-FK-520 PKS and at least one domain is from the FK- 
520 PKS. The non-FK-520 PKS domain or module originates from the rapamycin PKS, the 

20 FK-506 PKS, DEBS, or another PKS. The invention also provides recombinant expression 
vectors containing these nucleic acids. 

In another embodiment, the invention provides a method of preparing a polyketide. 
said method comprising transforming a host cell with a recombinant DNA vector that 
encodes at least one module of a PKS, said module comprising at least one FK-520 PKS 

25 domain, and culturing said host cell under conditions such that said PKS is produced and 
catalyzes synthesis of said polyketide. In one aspect, the method is practiced with a 
Sireptomvces host cell. In another aspect, the polyketide produced is FK-520. In another 
aspect, the polyketide produced is a polyketide related in structure to FK-520. In another 
aspect, the polyketide produced is a polyketide related in structure to FK-506 or rapamycin. 

30 In another embodiment, the invention provides a set of genes in recombinant form 

sufficient for the synthesis of ethylmalonyl CoA in a heterologous host cell. These genes 
and the methods of the invention enable one to create recombinant host cells with the ability 
to produce polyketides or other compounds that require ethylmalonyl CoA for biosynthesis. 
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The invention aiso provides recombinant nucleic acids that encode AT domains specific for 
ethylmalonyi Co A. Thus, the compounds of the invention can be used to produce 
poiykctidcs requiring ethyimaionyi CoA in host cells that otherwise are unabie to produce 
such poiyketides. 

In another embodiment, the invention provides a set of genes in recombinant form 
sufficient for the synthesis of 2-hydroxymalonyl CoA and 2-meihoxymalonyl CoA in a 
heterologous host cell. These genes and the methods of the invention enable one to create 
recombinant host cells with the ability to produce poiyketides or other compounds that 
require 2-hydroxymalonyl CoA for biosynth sis. The invention aiso provides recombinant 
nucleic acids that encode AT domains specific for 2-hydro\yrnaionyl CoA and 2- 
methoxymalonyl CoA. Thus, the compounds of the invention can be used to produce 
poiyketides requiring 2-hydroxymalonyl CoA or 2-methox vmaionyl CoA in host cells that 
are otherwise unable to produce such poiyketides. 

In another embodiment, the invention provides a compound related in structure to 
FK-520 or FK-506 that is useful in the treatment of a medical condition. These compounds 
include compounds in which the C-l 3 methoxy group is replaced by a moiety selected from 
the group consisting of hydrogen, methyl, and ethyl moieties. Such compounds are less 
susceptible to the main in vivo pathway of degradation for FK-520 and FK-506 and related 
compounds and thus exhibit an improved pharmacokinetic profile. The compounds of the 
invention also include compounds in which the C-l 5 methoxy group is replaced by a moiety 
selected from the group consisting of hydrogen, methyl, and ethyl moieties. The compounds 
of the invention also include the above compounds further modified by chemical 
methodology to produce derivatives such as, but not limited to, the C-l 8 hydroxyl 
derivatives, which have potent neurotrophin but not immunosuppresion activities. 

Thus, the invention provides poiyketides having the structure: 
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wherein. R t is hydrogen, methyl, eihyl, or ali\i: fv is hydrogen or hvciroxvl, provided that 
wnen is hydrogen, there is a uoubie bond between C-2'J ana L-i9; R; is hvdroeen or 
hydroxyl; R^ is methoxyl. hydrogen, methyl, or ethyl; and is methoxyh hydrogen, 
5 methvL or ethyl; but not including FK-506. FK-520, lS-hydroxy-FK-520, and 18-hydroxy- 
FK-506. The invention provides these compounds in purified form and in pharmaceutical 
compositions. 

In another embodiment, the invention provides a method for treating a medical 
condition by administering a pharmaceutical^' efficacious dose of a compound of the 
1 o invention. The compounds of the invention may be administered to achieve 
immunosuppression or to stimulate nerve growth and regeneration. 

These and other embodiments and aspects of the invention will be more fully 
understood after consideration of the attached drawings and their brief description below, 
together with the detailed description, examples, and claims that follow'. 

15 

Brief Description of the Drawings 
Figure 1 shows a diagram of the FK-520 biosynthetic gene cluster. The top line 
provides a scale in kilobase pairs (kb). The second line shows a restriction map with 
selected restriction enzyme recognition sequences indicated. K is Kpnh X is Xhoh S is Sac\\ 

20 P is Pst\ \ and E is EcolU. The third line indicates the position of FK-520 PKS and related 
genes. Genes are abbreviated with a one tetter designation, i.e.. C \s_fkbC. Immediately 
under the third line are numbered segments showing where the loading module (L) and ten 
different extender modules (numbered 1-10) are encoded on the various genes shown. At 
the bottom of the Figure, the DNA inserts of various c ;mids of the invention (i.e., 34-124 

25 is cosmid pKOS034-124) are shown in alignment with the FK-520 biosynthetic gene 
cluster. 

Figure 2 shows the loading module (load), the ten extender modules, and the peptide 
synthetase domain of the FK-520 PKS, together with, on the top line, the genes that encode 
the various domains and modules. Also shown are the various intermediates in FK-520 
30 biosynthesis, as well as the structure of FK-520, with carbons 13, 15, 21, and 31 numbered. 
The various domains of each module and subdomains of the loading module are also shown. 
The darkened circles showing the DH domains in modules 2, 3. and 4 indicate that the 
dehydratase domain is not functional as a dehydratase; this domain may affect the 
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stereochemistry at the corresponding position in the poiykeude. The substituents on the FK- 
520 structure that result from the action of r.on-PKS enzymes are also indicated bv arrow's, 
together with the types of enzymes or the genes thai code for :he enzymes that mediate the 
action. Although the methyitransferase is shown acting at the C-i? and C-15 hydroxvi 
5 groups after release of the polykctide from the PKS. the methyitransferase may act on the 2- 
hydroxymaionyl substrate pnor to or contemporaneously with its incorporation during 
polyketide synthesis. 

Figure 3 shows a close-up view of the left end of the FK-520 gene cluster, which 
contains at least ten additional gc- The ethyl side chan. on carbon 21 of FK-520 (Figure 

10 2) is derived from an ethylmalonyl CoA extender unit that is incorporated by an 

ethylmalonyl specific AT domain in extender module 4 of the PKS. At least four of the 
genes in this region code for enzymes involved in ethylmalonyl biosynthesis. The 
polyhydroxybutyrate depolymerase is involved in maintaining hydroxyburyryl-CoA pools 
during FK-520 production. Polyhydroxybutyrate accumulates during vegetative growth and 

1 5 disappears during stationary phase in other Streptomyces (Ranade and Vining, 1993, Can, J. 
Microbiol. 39:311). Open reading frames with unknown function are indicated with a 
question mark. 

Figure 4 shows a biosynthetic pathway for the biosynthesis of ethylmalonyl CoA 
from acetoacety! CoA consistent with the function assigned to four of the genes in the FK- 
20 520 gene cluster shown in Figure 3. 

Figure 5 shows a close-up view of the right-end of the FK-520 PKS gene cluster 
land of the sequences on cosmid pKOS065-C3 l'i. The genes shown include fkbD.fkbM (a 
methyl transferase that methylates the hydroxy! group on Col of FK-520),_/toA r (a 
homolog of a gene described as a regulator of cholesterol oxidase and that is believed to be 
25 a transcriptional activator), JkbQ (a type II thioesterase, which can increase polyketide 
production levels), and fkbS (a crotonyl-CoA reductase involved in the biosynthesis of 
ethylmalonyl CoA). 

Figure 6 shows the proposed degradative pathway for tacrolimus (FK-506) 

metabolism. 

30 Figure 7 shows a schematic process for the construction of recombinant PKS genes 

of the invention that encode PKS enzymes that produce 13-desmethoxy FK-506 and FK- 
520 polyketides of the invention, as described in Example 4. below. 
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Figure S. in Parts A and B. shows certain compounds of the invention preferred for 

dermai application m Part A ana a synthetic route for makmc those compounds in Part B. 



Detailed Description of the Invention 



Given the valuable pharmaceutical properties of poiyketides, there is a need for 
methods and reagents for producing large quantities of polvketides, as well as for producing 



related compounds not found in nature. The present invention, provides such methods and 
reagents, with particular application to methods r - ,, igents for producing the polvketides 
known as FK-520. also known as ascomycin or L-6S3,5vO (see ? >lt et aL, 1993, JACS 
7 75:9925), and FK-506. also known as tacrolimus. Tacrolimus is a macrohde 
immunosuppressant used to prevent or treat rejection of transplanted heart, kidney, liver, 
lung, pancreas, and small bowel allografts. The drug is also useful for the prevention and 
treatment of graft-versus-host disease in patients receiving bone marrow transplants, and for 
the treatment of severe, refractory uveitis. There have been additional reports of the 
unapproved use of tacrolimus for other conditions, including alopecia universalis, 
autoimmune chronic active hepatitis, inflammatory bowel disease, multiple sclerosis, 
primary biliary cirrhosis, and scleroderma. The invention provides methods and reagents for 
making novel polyketides related in structure to FK-520 and FK-506, and structurally 
related polyketides such as rapamycin. 

The FK-506 and rapamycin polyketides are potent immunosuppressants, with 
chemical structures shown below. 




OMe 



FK-506 



Rapamycin 
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FK-520 differs from FK-506 in that it lacks the uliyl group at C-21 of FK-506, having 
instead an ethyl group at thai position, and has similar activity to FK-506, albeit reduced 
immunosuppressive activity. 



protein "immunophilins" known as FKBPs (FK-506 binding proteins), including FKBP-12. 
Immunophilins are a class of cytosolic proteins that form complexes with molecules such as 
FK-506. FK-520. and rapamycin that in turn serve as ligands for other cellular targets 
involved in signal transduction. Binding of FK-506, FK-520, and rapamycin to FKBP 
occurs through the structural!; - c;r mlar segments of the poiyketide molecules, known as the 
"FKBP-binding domain" (as generally but not precisely indicated by the stippled regions in 
the structures above). The FK-506-FKBP complex then binds calcincunn, while the 
rapamycin-FKBP complex binds to a protein known as RAFT- 1 . Binding of the FKBP- 
polyketide complex to these second proteins occurs through the dissimilar regions of the 
drugs known as the "effector" domains. 



The three component FKJBP-polyketide-effector complex is required for signal 
transduction and subsequent immunosuppressive activity of FK-506, FK-520, and 
rapamycin. Modifications in the effector domains of FK-506. FK-520, and rapamycin that 
destroy binding to the effector proteins (calcineurin or RAFT) lead to loss of 
immunosuppressive activity, even though FKBP binding is unaffected. Further, such 
analogs antagonize the immunosuppressive effects of the parent polyketides, because they 
compete for FKBP. Such non-immunosuppressive analogs also show reduced toxicity (see 
Dumont et ai. 1992, Journal of Experimental Medicine 176, 7 5 1-760), indicating that much 
of the toxicity of these drugs is not linked to FKBP binding. 



These compounds act through initial formation of an intermediate complex with 





RAFT 
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In addition to immunosuppressive activity. FK-52a FKouo. and rapamycin have 
neurotrophic activity. In the central nervous system and in peripheral nerves, immunophiiins 
are referred to as "neuroimmunophiiins". The neuroimmunophiiin FKJBP is markedly 
enriched in the central nervous system and in peripheral nerves. Molecules that bind to the 
neuroimmunophilm FKBP, such as FK-506 and FK-520. have the remarkable effect of 
stimulating nen'c growth. In vitro, they act as neurotropics, i.e.. they promote neurite 
outgrowth in NGF-treated PC 12 cells and in sensory neuronal cultures, and in intact 
animals, they promote regrowth of damaged facial and sciatic nerves, and repair iesioned 
serotonin and dopamine neurons in the brain. See Gold et ,;/.. Jun. 1999, ./. Pharm. Exp. 
Ther. 259(3): 1202-1210; Lyons et ai, 1994, Proc. National Academy of Science 91: 3191- 
3 195; Gold et a!., 1995, Journal of Neuroscience 15: 7509-~5 1 6; and Steiner et a!. y 1997. 
Proc. National Academv of Science 94: 2019-2024. Further, the restored central and 
peripheral neurons appear to be functional. 

Compared to protein neurotrophic molecules (BNDF, NGF. etc.), the small- 
molecule neurotrophins such as FK-506, FK-520, and rapamycin have different, and often 
advantageous, properties. First, whereas protein neurotrophins are difficult to deliver to 
their intended site of action and may require intra-cranial injection, the small-molecule 
neurotrophins display excellent bioavailability; they are active when administered 
subcutaneously and orally. Second, whereas protein neurotrophins show quite specific 
effects, the small-molecule neurotrophins show rather broad effects. Finally, whereas 
protein neurotrophins often show effects on normal sensory nerves, the small-molecule 
neurotrophins do not induce aberrant sprouting of normal neuronal processes and seem to 
affect damaged nerves specifically. Neuroimmunophilm hgands have potential therapeutic 
utility in a variety of disorders involving nerve degeneration (e.g. multiple sclerosis, 
Parkinson's disease, Alzheimer 1 s disease, stroke, traumatic spinal cord and brain injury, 
peripheral neuropathies). 

Recent studies have shown that the immunosuppressive and neunte outgrowth 
activity of FK-506, FK-520, and rapamycin can be separated; the neuroregenerative activity 
in the absence of immunosuppressive activity is retained by agents which bind to FKBP but 
not to the effector proteins calcineurin or RAFT. See Steiner et ai, 1997, Nature Medicine 
3: 421-42S. 
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> 



Nerve Regeneration 



Available structure-activity data show that the important features for neurotrophic 
activity of rapamycin, FK-520, and FK-506 lie within the common, contiguous seements of 
the macroJide nng that bind to FKBP. This pomon of the molecule is termed the "FKBP 
binding domain' 1 (see VanDuyne et al. y 1993, Journal of 'Molecular Biology 229: 105-124.). 
Nevertheless, the effector domains of the parent macroiides contribute to conformational 
rigidity of the binding domain and thus indirectly contribute to FKBP binding. 



There are a number of other reported analogs of FK-506, FK-520, and rapamycin that bind 
10 to FKBP but not the effector protein calcineurin or RAFT. These analogs show effects on 

nerve regeneration without immunosuppressive effects. 

Naturally occumng FK-520 and FK-506 analogs include the antascomycins, which 

are FK-506-iike macroiides that lack: the functional gro'ips of FK-506 that bind to 

calcineurin (see Fehr et aL* 1996, Tlie Journal of Antibiotics 49: 230-233). These molecules 
1 5 bind FKBP as effectively as does FK-506; they antagonize the effects of both FK-506 and 

rapamycin, yet lack immunosuppressive activity. 




OH 



.OMe 



"FKBP binding domain" 
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Antascomv^ ~ A 

Other analogs can be produced by chemically modifying FK-506, FK-520, or 
rapamycin. One approach to obtaining neuroimmunophilm ligands is to destroy the effector 
binding region of FK-506, FK-520, or rapamycin by chemical modification. While the 
5 chemical modifications permuted on the parent compounds are quite limited, some useful 
chemically modified analogs exist. The FK-520 analog L-6S5.S18 (ED 50 = 0.7 nM for 
FK.BP binding; see Dumont et al„ 1992), and the rapamycin analog WAY- 124,466 (ICso = 
12.5 nM; see Ocain et aL, 1993, Biochemisuy Biophysical Research Communications 1 92: 
1340-134693) are about as effective as FK-506, FK-520, and rapamycin at promoting 
10 neurite outgrowth in sensory neurons (see Steiner et aL, 1997). 




L-685.818 WAY-124,466 

One of the few positions of rapamycin that is readily amenable to chemical 
modification is the allylic 16-methoxy group; this reactive group is readily exchanged by 
acid-catalyzed nucleophilic substitution. Replacement of the 16-methoxy group of 
15 rapamycin with a variety of bulky groups has produced analogs showing selective loss of 
immunosuppressive activity while retaining FKBP-binding (see Luengo et aL, 1995, 
Chemistry & Biology 2: 471-481). One of the best compounds. 1, below, shows complete 
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loss of activity in the splcnocytc proliferation assay with only a 10-fold reduction in bindir 
to FKBP. 




There are also synthetic analogs of FKBP binding domains. These compounds 
reflect an approach to obtaining neuroimmunophilin ligands based on "rationally desiened" 
molecules that retain the FKBP-binding region in an appropriate conformation for binding 
to FKBP. but do not possess the effector binding regions. In one example, the ends of the 
FKBP binding domain were tethered by hydrocarbon chains (see Holt et aL s 1993, Journal 
of the American Chemical Society 1 115: 9925-993S); the best analog, 2, below, binds to 
FKBP about as well as FK-506. In a similar approach, the ends of the FKBP binding 
domain were tethered by a tripeptide to give analog 3, below, which binds to FKBP about 
20-fold poorer than FK-506. These compounds are anticipated to have neuroimmunophilin 
binding activity. 




3 



In a primate MPTP model of Parkinson's disease, administration of FKBP ligand 
GPI-1046 caused brain cells to regenerate and behavioral measures to improve. MPTP is a 
neurotoxin, which, when administered to animals, selectively damages mgral-stnatal 
dopamine neurons in the brain, mimicking the damage caused by Parkinson's disease. 
Whereas, before treatment, animals were unable to use affected limbs, the FKBP ligand 
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restored the ability of animals to feed themselves and gave improvements in measures of 
locomotor activity, neurological outcome, and fine motor control. There were also 
corresponding increases in regrowth of damaged nerve terminals. These results demonstrate 
the utility of FKBP iigands for treatment of diseases of the CNS. 
^ From the above description, two general approaches towards the design of non- 

immunosuppressant, neuroimmunophilin iigands can be seen. The first involves the 
construction of constrained cyclic analogs of FK-506 in which the FKBP binding domain is 
fixed in a conformation optimal for binding to FKBP. The advantages of this approach are 
that the conformation of the analogs can be accurately nioaeied and predicted by 

10 computational methods, and the analogs closely resemble parent molecules that have proven 
pharmacological properties. A disadvantage is that the difficult chemistry limits the 
numbers and types of compounds that can be prepared. The second approach involves the 
trial and error construction of acyclic analogs of the FKBP binding domain by conventional 
medicinal chemistry. The advantages to this approach are that the chemistry is suitable for 

15 production of the numerous compounds needed for such interactive chemistry-hioassay 
approaches. The disadvantages are that the molecular types of compounds that have 
emerged have no known history of appropriate pharmacological properties, have rather 
labile ester functional groups, and are too conformational!}' mobile to allow accurate 
prediction of conformational properties. 

20 The present invention provides useful methods and reagents related to the first 

approach, but with significant advantages. The invention provides recombinant PKS genes 
that produce a wide variety of polyketides that cannot otherwise be readily synthesized by 
chemical methodology alone. Moreover, the present invention provides polyketides that 
have either or both of the desired immunosuppressive and neurotrophic activities, some of 

25 which are produced only by fermentation and others of which are produced by fermentation 
and chemical modification. Thus, in one aspect, me invention provides compounds that 
optimally bind to FKBP but do not bind to the effector proteins. The methods and reagents 
of the invention can be used to prepare numerous constrained cyclic analogs of FK-52G in 
which the FKBP binding domain is fixed in a conformation optimal for binding to FKBP. 

30 Such compounds will show neuroimmunophilin binding (neurotrophic) but not 

immunosuppressive effects. The invention also allows direct manipulation of FK-520 and 
related chemical structures via genetic engineering of the enzymes involved in the 
biosynthesis of FK-520 (as well as related compounds, such as FK-506 and rapamycm); 
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simuar chemical modifications are simply not possible because of the complexity of the 
structures. The invention can also be used to introduce "chemical handles" into normally 
inert positions that permit subsequent chemical modifications. 

Several general approaches to achieve the development of novel neuroimmunophilin 
hgands are facilitated by the methods and reagents of the present invention. One approach is 
to make "point mutations" of the functional groups of the parent FK-520 structure that bind 
to the effector molecules to eliminate their binding poien:;a!. These types of structural 
modifications are difficult to perform by cnemical modification, but can be readily 
accomplished with the methods and reagents of the m-.emion. 

A second, more extensive approach facilitated by me present invention is to utilize 
molecular modeling to predict optimal structures ah uu:io that bind to FKBP but not 
effector molecules. Using the available X-ray crystal structure of FK-520 (or FK-506) 
bound to FKJBP, molecular modeling can be used to predict polyketides that should 
optimally bind to FKBP but not calcineunn. Various macrohde structures can be generated 
by linking the ends of the FKBP-binding domain with "all possible" polykeude chains of 
variable length and substitution patterns that can be prepared by genetic manipulation of the 
FK-520 or FK-506 PKS gene cluster in accordance with the methods of the invention. The 
ground state conformations of the virtual library can be determined, and compounds that 
possess binding domains most likely to bind well to FKBP can be prepared and tested. 

Once a compound is identified in accordance with the above approaches, the 
invention can be used to generate a focused library of analogs around the lead candidate, to 
"fine tune" the compound for optimal properties. Finally, the genetic engineering methods 
of the invention can be directed towards producing "chemical handles 1 ' that enable 
medicinal chemists to modify positions of the molecule previously inert to chemical 
modification. This opens the path to previously prohibited chemical optimization of lead 
compounds by time-proven approaches. 

Moreover, the present invention provides polyketide compounds and the 
recombinant genes for the PKS enzymes that produce the compounds that have significant 
advantages over FK-506 and FK-520 and their anaiogs. The metabolism and 
pharmacokinetics of tacrolimus has been exstensively studied, and FK-520 is believed to be 
similar in these respects. Absorption of tacrolimus is rapid, variable, and incomplete from 
the gastrointestinal tract (Harrison's Principles of Internal Medicine, 14th edition, 199S, 
McGraw Hill, 14, 20, 21, 64-67). The mean bioavailability of the oral dosage form is 27%, 
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i range 5 10 <o",o. The volume of distribution (Volt)} based on plasma is 5 to 65 L per ke of 
body weight (L kg), and is much higher than the VolD based on whole blood 
concentrations, tne difference reflecting the binding of tacrolimus to red blood ceils. Whole 
blood concentrations may be 12 to 67 times the piasma concentrations. Protein binding is 
high (75 to 99° uh primarily to albumin and alpha 1 -acid glycoprotein. The half-life for 
distribution is 0.9 hour; elimination is biphasic and variable: tenmnah 1 1.3 hr (range, 3.5 to 
40.5 hours). The time to peak concentration is 0.5 to 4 hours after oral administration. 

Tacrolimus is metabolized primarily by cytochrome P450 3 A enzymes m the liver 
and small intestine. The drug is extensively metabolized with less than 1% excreted 
unchanged in urine. Because hepatic dysfunction decreases clearance of tacrolimus, doses 
have to be reduced substantially in primary graft non-function, especially in children. In 
addition, drugs that induce the cytochrome P450 3A enzymes reduce tacrolimus levels, 
while drugs that inhibit these P450s increase tacrolimus levels. Tacrolimus bioavailability 
doubles with co-administration of ketoconazole, a drug that inhibits P450 3 A. See, Vincent 
et ai, 1992, //; vitro metabolism of FK-506 in rat, rabbit, and human liver microsomes: 
Identification of a major metabolite and of cytochrome P450 3 A as the major enzymes 
responsible for its metabolism, Arch. Biochem, Biophys. 294: 454-460; Iwasaki et a/., 1993, 
Isolation, identification, and biological activities of oxidative metabolites of FK-506, a 
potent immunosuppressive macrolide lactone. Drug Metabolism <£ Disposition 21: 971-977: 
Shiraga et a!.. 1994, Metabolism of FK-506. a potent immunosuppressive agent, by 
cytochrome P450 3A enzymes in rat, dog. and human liver microsomes, Biochem. 
Pharmacol. 47: 727-735; and Iwasaki et ai. 1995. Further metabolism of FK-506 
(Tacrolimus); Identification and biological activities of the metabolites oxidized at multiple 
sites of FK-506, Drug Metabolism & Disposition 23: 23-34. The cytochrome P450 3 A 
subfamily of isozymes has been implicated as important in this degradative process. 

Structures of the eight isolated metabolites formed by iiver microsomes are shown in 
Figure 6. Four metabolites of FK-506 involve demethylation of the oxygens on carbons 13, 
15, and 31, and hydroxyiation of carbon 12. The 13-demethylated (hydroxy) compounds 
undergo cychzations of the 13 -hydroxy at C-10 to give Ml MVT and MVTI, and the 12- 
hydroxy metabolite at C-10 to give I. Another four metabolites formed by oxidation of the 
four metabolites mentioned above were isolated by liver microsomes from dexamethasone 
treated rats. Three of the~e are metabolites doubly demethylated at the methoxy groups on 
carbons 15 and 31 (M-V), 13 and 31 (MAT), and 13 and 15 (M-VTIV The fourth, M-VIIL 
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was the metabolite produced alter demetir. lanon of the 3 i -methoxy group, followed by 
formation of a fused ring system by further oxidation. Among the eight metabolites. \HI 
has immunosuppressw e activity comparable to that of FK-5 !i o, waereas the other 
metabolites exhibit weak or negligible activities. Importantly, the major metabolite of 
5 human, dog. and rat liver microsomes is the 1 3-demethylated and cyciized FK-506 (M-I). 

Thus, the major metabolism ofFK-506 proceeds via 1 3-demethylation followed by 
cychzation to the inactive M-I, this representing about 90" ^ of the metabolic products after a 
10 minute incubation with liver microsomes. Analogs of tacrolimus that do not possess a C- 
13 methoxy group would not be susceptible to me first a:m most important 

10 biotransformation in the destructive metabolism of tacrolimus \ i.e. cyciization of 13- 

hydroxv to C-10). Thus, a 1 3-desmethoxy analog of FK-5oo should have a longer half-life 
in the body than does FK-506. The C-13 methoxy group is believed not to be required for 
binding to FKBP or calcineurin. The C-13 methoxy is not present on the identical position 
of rapamycin. which binds to FKBP with equipotem affinity as tacrolimus. Also, analysis of 

15 the 3-dimensional structure of the FKBP-tacrolimus-calcineurin complex shows that the C- 
13 methoxy has no interaction with FKBP and only a minor interaction with calcineurin. 
The present invention provides C-l 3-desmethoxy analogs of FK-506 and FK-520, as well 
as the recombinant genes that encode the PKS enzymes that catalyze their synthesis and 
host cells that produce the compounds. 

20 These compounds exhibit, relative to their naturally occurring counterparts, 

prolonged immunosuppressive action /// vivo, thereby allowing a lower dosage and or 
reduced frequency of administration. Dosing is more predictaole. because the variability in 
FK-506 dosage is largely due to variation of metabolism rate. FK-506 levels in blood can 
van' widely depending on interactions with drugs that induce or inhibit cytochrome P450 

25 3A (summarized in USP Drug Information for the Health Care Professional). Of particular 
importance are the numerous drugs that inhibit or compete for CYP 3 A. because they 
increase FK-506 blood levels and lead to toxicity (Prograf package insert, FujisawalUS, 
Rev 4 97, Rec 6/97). Also important are the drugs that induce P450 3A {e.g. 
Dexamethasone), because they decrease FK-506 blood levels and reduce efficacy. Because 

30 the major site of CYP 3 A action on FK-506 is removed in the analogs provided by the 

present invention, those analogs are not as susceptible to drug interactions as the naturally 
occurring compounds. 
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Hyperglycemia, nephrotoxicity, and neurotoxicity are the most significant adverse 
effects resulting from the use of FK-50o ana are beneved to he similar for FK-520. Because 
[iiese effects appear to occur primarily by the same mechanism as the immunosuppressive 
action (i.e. FKBP-calcineunn interaction), the intrinsic toxicity of the desmethoxy analogs 
5 may be similar to FK-506. However, toxicity of FK-506 is dose related and correlates with 
high blood levels of the drug (Prograf package insert. Fujisawa "US. Rev 4'97, Rec 6/97). 
Because the levels of the compounds provided by the present invention should be more 
controllable, the incidence of toxicity should be significantly decreased with the 13- 
desmethox'- nalogs. Some reports show that certain FK-506 metabolites are more toxic 

10 than FK-506 itself, and this provides an additional reason to expect that a CYP 3 A resistant 
analog can have lower toxicity and a higher therapeutic index. 

Thus, the present invention provides novel compounds related in structure to FK- 
506 and FK-520 but with improved properties. The invention also provides methods for 
making these compounds by fermentation of recombinant host cells, as well as the 

15 recombinant host cells, the recombinant vectors in those host cells, and the recombinant 
proteins encoded by those vectors. The present invention also provides other valuable 
materials useful in the construction of these recombinant vectors that have many other 
important applications as well. In particular, the present invention provides the FK-520 PKS 
genes, as well as certain genes involved in the biosynthesis of FK-520 in recombinant form. 

20 FK-520 is produced at relatively low levels in the naturally occurring cells, 

Sircptomyccs hygroscopicus var. ascomyceucus. in which it was first identified. Thus, 
another benefit provided by the recombinant FK-520 PKS and related genes of the present 
invention is the ability to produce FK-520 in greater quantities in the recombinant host cells 
provided by the invention. The invention also provides methods for making novel FK-520 

25 analogs, in addition to the desmethoxy analogs described above, and derivatives in 
recombinant host cells of any origin. 

The biosynthesis of FK-520 involves the action of several enzymes. The FK-520 
PKS enzyme, which is composed of the Jl:bA,fkhBJlbC, and JkbP gene products, 
synthesizes the core structure of the molecule. There is also a hydroxylation at C-9 mediated 

30 by the P450 hydroxylase that is xhe fkbD gene product and that is oxidized by ihtfkbO gene 
product to result in the formation of a keto group at C-9. There is also a methylation at C-3 1 
that is mediated by an O-methyltransferase that is the fkhSl gene product. There are also 
methylations at the C- 13 and C-15 positions by a methyhransferase believed to be encoded 



SUBSTITUTE SHEET (RULE 26) 



WO 00/20601 PCTAJS99/22886 



in the fkbG gene, this methyltransrerase may act on the hydroxymalon\'i CoA substrates 
prior to binding of the substrate to the AT domains of the PKS during poiyketide svnthesis. 
t he present invention provides the genes encoding these enzymes in recombinant tbrm. live 
invention aiso provides the genes encoding the enzymes involved in eihylmalonvl CoA and 
3 2-hydroxymalonyl CoA biosynthesis in recombinant form. Moreover, the invention 

provides Sircptomyces hygroscopicus var. ascomyccticus recombinant host cells lacking 
one or more of these genes that are useful in the production of useful compounds. 

The celis are useful in production in a vanery of ways. First, certain celis make a 
useful i k-.' . jiated compound merely as a result or " .anon of one or more of tuc 

10 FK-520 biosynthesis genes. Thus, by inactivating the Col O-methyltransferase gene in 
Sircptomyces hygroscopicus var. ascomyccticus. one creates a host cell that makes a 
desmethyl (at Col) derivative of FK-520. Second, other ceils of the invention are unable to 
make FK-520 or FK-520 related compounds due to an mactivauon of one or more of the 
PKS genes. These cells are useful in the production of other poiyketides produced by PKS 

1 5 enzymes that are encoded on recombinant expression vectors and introduced into the host 
cell. 

Moreover, if only one PKS gene is inactivated, the ability to produce FK-520 or an 
FK-520 derivative compound is restored by introduction of a recombinant expression vector 
that contains the functional gene in a modified or unmodified form. The introduced gene 

20 produces a gene product that, together with the other endogenous and functional gene 

products, produces the desired compound. This methodology enables one to produce FK- 
520 derivative compounds without requiring that all of the genes for the PKS enzyme be 
present on one or more expression vectors. Additional applications and benefits of such 
cells and methodology will be readily apparent to those of skill in the an after consideration 

25 of how the recombinant genes were isolated and employed in the construction of the 
compounds of the invention. 

The FK-520 biosynthetic genes were isolated by the following procedure. Genomic 
DNA was isolated from Streptomyces hygroscopicus var. ascomyceticus (ATCC 14891) 
using the lysozyme/proteinase K protocol described in Genetic Manipulation of 

30 Streptomyces - A Laboratory Manual (Hopwood ci a!., 1986), The average size of the DNA 
was estimated to be between 80 - !20 kb by electrophoresis on 0.3% agarose gels. A library 
was constructed in the SuperCos™ vector according to the manufacturer's instructions and 
with the reagents provided in the commercially available kit (Stratagene). Briefly, 100 ug of 
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genomic DNA v. as partially digested with 4 units of YuwSA i for 2u mm. in a reaction 
volume of 1 mL. arid the fragments were dephosphoryiated and iigated to SupcrCos vector 
.;nns. file hgated DNA was packaged and used to mfeet iog-s;age XL 1-BlueMR cells. A 
library of about Mj.OOu independent cosmid clones w as obtained. 

Based on recently published sequence from the FK-5oo cluster (Motamedi and 
Shafiee. 1998. Eur. J. Biochcm. 256: 52S), a probe tor ihc iJ-rO gene was isolated from 
ATCC 14891 using PCR with degenerate primers. With this probe, a cosmid designated 
pKOS034-124 was isolated from the library. With probes made from the ends of cosmid 
nrvoS034-124. an additional cosmid designated pKOS03 — 12o was isolated. These cosmids 
(pKOS034-]24 and pK.OS034-l 20) were shown to contain DN'A inserts that overlap with 
one another. Initial sequence data from these two cosmids generated sequences similar to 
sequences from the FK-506 and rapamycm clusters, indicating that the inserts were from the 
FK-520 PKS gene cluster. Two EroRl fragments were stibeioned from cosmids pKOS034- 
124 and pKOS034- i 20. These subclones were used to prepare shotgun libraries by partial 
digestion with Sau3 AL gel purification of fragments between 1 .5 kb and 3 kb in size, and 
ligation into the pLitmus2S vector (New England Biolabs). These libraries were sequenced 
using dye terminators on a Beckmann CEQ2000 capillary electrophoresis sequencer, 
according to the manufacturer's protocols. 

To obtain cosmids containing sequence on the left and right sides of the sequenced 
region described above, a new cosmid library of ATCC 14S9I DNA was prepared 
essentially as described above. This new library was screened with a new fkhM probe 
isolated using DNA from ATCC 14S9L A probe representing :he /khP gene at the end of 
cosmid pKOS034-124 was also used. Several additional cosmids to the right of the 
previously sequenced region were identified. Cosmids pKOS065-C31 and pKOS065-C3 
were identified and then mapped with restriction enzymes. Initial sequences from these 
cosmids were consistent with the expected organization of the cluster in this region. More 
extensive sequencing showed that both cosmids contained in addition to the desired 
sequences, other sequences not contiguous to the desired sequences on the host cell 
chromosomal DNA. Probing of additional cosmid libraries identified two additional 
cosmids, pKOS065-M27 and pKOS065-M2L that contained the desired sequences in a 
contiguous segment of chromosomal DNA, Cosmids pKOS034-!24, pKOS034-120, 
pKOS065-M27. and pKOS065-M21 have been deposited with the American Type Culture 
Collection, Manassas. YA. L'SA. The complete nucleotide sequence of the coding 
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sequences of the genes that encode the proteins of the FK-520 PKS are shown below but 
can also be determined from the cosmids of tire invention deposited with the ATCC using 
standard methodology. 

Referring to Figures i and 3. the FK-52U PKS gene cluster is composed of four open 
5 reading frames designated fkbB, JkbC\ fkhA. ancifkhP. The fkbB open reading frame encodes 
the loading module and the first four extender modules of the PKS. The jkhC optn reading 
frame encodes extender modules five and six of the PKS. The fkhA open reading frame 
encodes extender modules seven, eight, nine, and ten of the PKS. ThcJkbP open reading 
frame encodes the NRPS of the PKS. Each uf these genes can he isolated from cosmids 
1 0 of the invention described above. The DNA sequences of these genes are provided below- 
preceded by the following table identifying the start and stop codons of the open reading 
frames of each uene and the modules and domains contained therein. 





Nucleotides 


Gene or Domain 


15 


complement (412 - 1S36) 


jkblY 




complement (2020 - 3579) 


'fhbV 




complement (3969 - 4496) 


fkbR2 




complement (4595 - 5488) 


'jkbRl 




5601 - 6818 


JkbE 


20 


680S - 8052 


JkbF 




8156- 8S24 


fkbG 




complement (9122 - 9883) 


fkbH 




complement (9894 - 10994) 


'fkbl 




complement (1 09S7 - 1 1247) 


JkbJ 


25 


complement (1 1244 - 12092 ) 


fkbK 




complement (12113- 131 50) 


'fkbl. 




complement (13212 - 239SS) 


'fkbC 




complement (23992 - 46573) 


fkbB 




46754 - 4778S 


'fkbO 


30 


477S5 - 52272 


'jkbP 




52275 - 71465 


fkbA 




71462 - 72628 


'fkbD 




72625 - 73407 


'jkbM 




complement (73460 - 76202) 


fkbK 


35 


complement (76336 - 770S0) 


fkbQ 




complement (77076 - 77535) 


'jkbS 




complement (44974 - 46573) 


CoA ligase of loading domain 




complement (43777 - 44629) 


ER of ioading domain 




complement (43144 - 43660) 


ACP of loading domain 


40 


complement (41842 - 43093) 


KS of extender module 1 (KS1 




compiement(40609 - 41842) 


ATI 




complement (39442 10609) 


DH1 




complement (3S677 - 39307) 






complement (3S371 - 385S1) 


ACPI 
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complement (37145 - 


3S296) 


K.S2 




complement (35749 - 


37144) 


at: 




complement ( 34606 - 


35?4<)i 


DM2 (inactive) 




complement (33S23 - 


344SU) 


KR2 


5 


complement (33505 - 


33715) 


ACP2 




complement {321 S 5 - 


33439) 


KS3 




complement (3 1 01 S - 


321S5) 


ATM 




complement (29S69 - 


31018) 


DH3 ( inactive) 




complement (29092 - 


29740) 


KLR3 


!0 


complement (2S750 - 


2S960) 


ACP3 




complement (27430 - 


2S6S4) 


KS4 




complement (26146 - 


27430) 


AT4 




complement (24997 - 


26146) 


DH4 (inactive) 




complement (24163 - 


24373) 


ACP4 


15 


complement (22653 - 


23S92) 


ICS5 




complement (21420 - 


22653) 


AT5 




complement (20241 - 


21420) 


DH5 




complement ( 194(4 - 


20097) 


KR5 




complement (19116- 


19326) 


ACT 5 


:o 


compiement ( 1 7S20 - 


19053) 


KS6 




complement (165S7 - 


17S20) 


AT6 




complement ( 1 543S - 


165S7) 


DH6 




complement ( 145 1 7 - 


1 5294) 


ER6 




complement (13761 - 


14394) 


KR6 


25 


complement (13452 - 


13662) 


ACP6 




52362 - 53576 




KS7 




53577 - 54716 




AT7 




54717 - 55871 




DH7 




56019 - 56819 




ER7 


30 


56943 - 57575 




KR7 




57710 - 57920 




ACP7 




57990 - 59243 




KS8 




59244 - 60398 




ATS 




60399 - 61412 




DH8 (inactive) 


35 


61548 - 62 ISO 




KR8 




6232S - 62537 




ACP8 




62598 - 63854 




KS9 




63S55 - 65084 




AT9 




650S5 - 66254 




DH9 


40 


66399 - 67H5 




ER9 




67299 - 67931 




KLR9 




68094 - 68303 




ACP9 




68397 - 69653 




KS10 




69654 - 70985 




AT10 


45 


71064 - 71273 




ACP 1 0 
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60 62581 3GGGCGGTTA GCTGG2CGGG GCGGCGGAGT TCGACGCGGC GTTCTTCGAC A7 7AGTCCGC 
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G AGGGACGAG7 77 7GGATCGG GTGGGGGGAG 

74 101 TCCCGGAACG 7G77GTACGC GGGCCGG7AG CGCGCGGGGG 1GAGA7GG7G 77GCCCACGG 

74 161 G7G7AGAC7A 7G7GGAG7GC GAAGAGGC7G 7 7GGAG3717 7 "777 Z GGG^A CGGGTGGGGG 

74 221 AGG7A77777 G7GCCGGG7G CAGGTGGGGC AG7GGGA777 7777G77GAC GG7GGTG77C 

74 34 1 GGGGA77GGA CGGGGGCGGT GAGGTGGGGG GGGCGGAG 17 777777GGGC GGGGAA.C77G 

4 4 1 1 G 1G7 GGAGGG GG7GG7CGGG GGGGG7GGGG A7^ i - - - - - - ,J ; 

40 "4 581 7G777GGA7C AGG3GGGGAG CGCGTTGCTC AGGGGG7777 7 3 7CGACGG7 G3GG7G7GGG 

7 4 741 ACGG77GCGG .^AvAACGAGGC GACCTCGTCC TCGGCGGGCG GATGGGGCGG ACGCGGC^GA 

74" 01 TGGG3GGGGG GGGGA7AGAT CAGGGCGAGG GACAGG7G7 3 3GAGG3GCAG GTGCGCCCGG 

74 7 cl CGCTGGTCGG 7GGGGGCGGG GGAGCGCTGG GGCGGGAGGA 7G7GGGGGG3 CTCGCCCo'jC 

"7 4 821 GGC7CG7CGA 7GGGGAGCCA GCAGGCGAGG GAGAGGG7G7 GGTGGG7GGA GAGGAGGGGT 
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Those of skill in the art will recognize that, due to the degenerate nature of the 



genetic code, a vanetv of DNA compounds differing in their nucleotide sequences can be 
used to encode a given amino acid sequence of the invention. The native DNA sequence 
encoding the FK-520 PKS of Streptomyces hygroscopicus is shown herein merely to 

35 illustrate a preferred embodiment of the invention, and the present invention includes DNA 
compounds of any sequence that encode the amino acid sequences of the polypeptides and 
proteins of the invention. In similar fashion, a polyp^ide can typically tolerate one or more 
amino acid substitutions, deletions, and insertions in its amino acid sequence without loss or 
significant loss of a desired activity. The present invention includes such polypeptides with 

40 alternate amino acid sequences, and the amino acid sequences shown merely illustrate 
preferred embodiments of the invention. 

The recombinant nucleic acids, proteins, and peptides of the invention are many and 
diverse. To facilitate an understanding of the invention and the diverse compounds and 
methods provided thereby, the following general description of the FK-520 PKS genes and 

45 modules of the PKS proteins encoded thereby is provided. This general description is 

followed by a more detailed description of the various domains and modules of the FK-520 
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PKS contained in and encoded by the compounds of the invention. In this descnption, 
reference to a heterologous PKS refers to any PKS other than the FK-520 PKS. Unless 
otherwise indicated, reference to a PKS includes reference to a portion of a PKS. Moreover, 
reference to a domain, module, or PKS includes reference to the nucleic acids encoding the 
5 same and vice-versa, because the methods and reagents of the invention provide or enable 
one to prepare proteins and the nucleic acids that encode them. 

The FK-520 PKS is composed of three proteins encoded by three genes designated 
JkbA,fl:bB. and JkbC. The/foM ORF encodes extender modules 7-10 of the PKS. TtejkbB 
ORF encodes the loading module ane Co A ligase) and extender modules 1 - 4 of the PKS. 

10 ThcjTcbC ORF encodes extender modules 5 - 6 of the PKS. Thzjl-bP ORF encodes the 
NRPS that attaches the pipecolic acid and cychzes the FK-520 polyketide. 

The loading module of the FK-520 PKS includes a CoA Iigase, an ER domain, and 
an ACP domain. The starter building block or unit for FK-520 is believed to be a 
dihydroxycyclohexene carboxylic acid, which is derived from shikimate. The recombinant 

15 DNA compounds of the invention that encode the loading module of the FK-520 PKS and 
the corresponding polypeptides encoded thereby are useful for a variety of methods and in a 
variety of compounds. In one embodiment, a DNA compound comprising a sequence that 
encodes the FK-520 loading module is inserted into a DNA compound that comprises the 
coding sequence for a heterologous PKS. The resulting construct, in which the coding 

20 sequence for the loading module of the heterologous PKS is replaced by the coding 

sequence for the FK-520 loading module, provides a novel PKS coding sequence. Examples 
of heterologous PKS coding sequences include the rapamycin. FK-506, rifamycin, and 
avermectin PKS coding sequences. In another embodiment, a DNA compound comprising a 
sequence that encodes the FK-520 loading module is inserted into a DNA compound that 

25 comprises the coding sequence for the FK-520 PKS or a recombinant FK-520 PKS that 
produces an FK-520 derivative. 

In another embodiment, a portion of the loading module coding sequence is utilized 
in conjunction with a heterologous coding sequence. In this embodiment, the invention 
provides, for example, either replacing the CoA iigase with a different CoA ligase, deleting 

30 the ER, or replacing the ER with a different ER. In addition, or alternatively, the ACP can 
be replaced by another ACP. In similar fashion, the corresponding domains in another 
loading or extender module can be replaced by one or more domains of the FK-520 PKS. 
The resulting heterologous loading module coding sequence can be utilized m conjunction 
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with a coding sequence for a PKS that synthesizes FK-520. an FK-520 derivative, or 
another poiyketide. 

The first extender module of the FK-520 PKS includes a KS domain, an AT domain 
specific for methylmalonyl Co A, a DH domain, a KR domain, ana an ACP domain. The 
recombinant DNA compounds of the invention that encode the first extender module of the 
FK-520 PKS and the corresponding polypeptides encoded thereby are useful for a variety of 
applications. In one embodiment, a DNA compound comprising a sequence that encodes the 
FK-520 first extender module is inserted into a DNA compound that comprises the coding 
sequence for a heterologous PKS. The resulting construes in wh: h the coding sequence for 
a module of the heterologous PKS is either replaced by that for the first extender module of 
the FK-520 PKS or the latter is merely added to coding sequences for modules of the 
heterologous PKS, provides a novel PKS coding sequence, in another embodiment, a DNA 
compound comprising a sequence that encodes the first extender module of the FK-520 
PKS is inserted into a DNA compound that comprises the remainder of the coding sequence 
for the FK-520 PKS or a recombinant FK-520 PKS that produces an FK-520 derivative. 

In another embodiment, all or only a portion of the first extender module coding 
sequence is utilized in conjunction with other PKS coding sequences to create a hybrid 
module. In this embodiment, the invention provides, for example, either replacing the 
methylmalonyl Co A specific AT with a malonyl Co A, ethylmalonyl Co A, or 2- 
hydroxymalonyl CoA specific AT; deleting either the DH or KR or both; replacing the DH 
or KR or both with another DH or KR; and/or inserting an ER. In replacing or inserting KJR, 
DH, and ER domains, it is often beneficial to replace the existing KR, DH, and ER domains 
with the complete set of domains desired from another module. Thus, if one desires to insert 
an ER domain, one may simply replace the existing KR and DPI domains with a KR, DH, 
and ER set of domains from a module containing such domains. In addition, the KS and/or 
ACP can be replaced with another KS and/or ACP. In each of these replacements or 
insertions, the heterologous KS, AT, DH. KR, ER, or ACP coding sequence can originate 
from a coding sequence for another module of the FK-520 PKS, from a gene for a PKS that 
produces a poiyketide other than FK-520, or from chemical synthesis. The resulting 
heterologous first extender module coding sequence can be utilized in conjunction with a 
coding sequence for a PKS that synthesizes FK-520, an FK-520 derivative, or another 
poiyketide. In similar fashion, the corresponding domains in a module of a heterologous 
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PKS can be replaced by one or more domains of the first extender module of the FK-520 
PKS. 

In an illustrative embodiment of this aspect of the invention, the invention provides 
recombinant PKSs and recombinant DNA compounds and vectors that encode such PKSs in 
5 which the KS domain of the first extender module has been inactivated. Such constructs are 
especially useful when placed in translational reading frame with the remaining modules 
and domains of an FK-520 or FK-520 derivative PKS. The utility of these constructs is that 
host cells expressing, or cell free extracts containing, the PKS encoded thereby can be fed or 
supnlied with N-acylcvsteami :hioes!^rs of novel precursor molecules to prepare FK-520 
10 derivatives. See U.S. patent application Serial No. 60/1 17,3S4, filed 27 Jan. 1999. and PCT 
patent publication Nos. US97/0235S and US99/03986, each of which is incorporated herein 
by reference. 

The second extender module of the FK-520 PKS includes a KS, an AT specific for 
methylmalonyl CoA, a KR, an inactive DH, and an ACP. The recombinant DNA 

1 5 compounds of the invention that encode the second extender module of the FK-520 PKS 
and the corresponding polypeptides encoded thereby are useful for a variety of applications. 
In one embodiment, a DNA compound comprising a sequence that encodes the FK-520 
second extender module is inserted into a DNA compound that comprises the coding 
sequence for a heterologous PKS. The resulting construct, in which the coding sequence for 

20 a module of the heterologous PKS is either replaced by that for the second extender module 
of the FK-520 PKS or the latter is merely added to coding sequences for the modules of the 
heterologous PKS. provides a novel PKS coding sequence, In another embodiment, a DNA 
compound comprising a sequence that encodes the second extender module of the FK-520 
PKS is inserted into a DNA compound that comprises the coding sequence for the 

25 remainder of the FK-520 PKS or a recombinant FK-520 PKS that produces an FK-520 
derivative. 

In another embodiment, all or a portion of the second extender module coding 
sequence is utilized in conjunction with other PKS coding sequences to create a hybrid 
module. In this embodiment, the invention provides, for example, either replacing the 
30 methylmalonyl CoA specific AT with a maionyl CoA, ethylmalonyl CoA, or 2- 

hydroxymalonyl CoA specific AT; deleting the KR and/or the inactive DH; replacing the 
KR with another KR; and; or inserting an active DH or an active DH and an ER. In addition, 
the KS and/or ACP can be replaced with another KS and/or ACP. In each of these 
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replacements or insertions, the heterologous KS. AT. DM. KR. HR. or ACP coding 
sequence can originate from a coding sequence for another module of the FK-520 PKS, 
from a coding sequence tor a PKS thai produces a poiykcuee other man FKo20* or from 
chemicai synthesis. The resulting heterologous second extender module coding sequence 
5 can be utilized in conjunction with a coding sequence from a PKS that synthesizes FK-520, 
an FK-520 derivative, or another poiyketide. In similar fashion, the corresponding domains 
in a module of a heterologous PKS can be replaced by one or more domains of the second 
extender module of the FK-520 PKS. 

The third extender module of the FK-520 PKS includes a KS, an AT specific for 

10 malonyl CoA, a KR, an inactive DHL and an ACP. The recombinant DNA compounds of the 
invention that encode the third extender module of the FK-520 PKS and the corresponding 
polypeptides encoded thereby are useful for a variety of applications, in one embodiment, a 
DNA compound comprising a sequence that encodes the FK-520 third extender module is 
inserted into a DNA compound that comprises the coding sequence for a heterologous PKS. 

15 The resulting construct, in which the coding sequence for a module of the heterologous PKS 
is either replaced by that for the third extender module of the FK-520 PKS or the latter is 
merely added to coding sequences for the modules of the heterologous PKS, provides a 
novel PKS coding sequence. In another embodiment, a DNA compound comprising a 
sequence that encodes the third extender module of the FK-520 PKS is inserted into a DNA 

20 compound that comprises the coding sequence for the remainder of the FK-520 PKS or a 
recombinant FK-520 PKS that produces an FK-520 derivative. 

In another embodiment, all or a portion of the third extender module coding 
sequence is utilized in conjunction with other PKS coding sequences to create a hybrid 
module. In this embodiment, the invention provides, for example, either replacing the 

25 malonyl CoA specific AT with a methylmalonyl CoA, ethylmalonyi CoA, or 2- 

hydroxynialonyi CoA specific AT; deleting the Kiv unci or the inactive DPI; replacing the 
KR with another KR: and/or inserting an active DH or an active DPI and an ER. In addition, 
the KS and/or ACP can be replaced with another KS and/or ACP. In each of these 
replacements or insertions, the heterologous KS. AT, DH, KR, ER. or ACP coding 

30 sequence can originate from a coding sequence for another module of the FK-520 PKS, 
from a coding sequence for a PKS that produces a poiyketide other than FK-520, or from 
chemical synthesis. The resulting heterologous third extender module coding sequence can 
be utilized in conjunction with a coding sequence from a PKS that synthesizes FK-520, an 
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FK-520 derivative, or another polyketide. In similar fashion, the corresponding domains in a 
module of a heterologous PKS can be replaced by one or more domains of die third 
extender moduie of the FK-520 PKS. 

The fourth extender module of the FK-520 PKS includes a KS, an AT that binds 
ethylmatonyl Co A, an inactive DH. and an ACP. The recombinant DNA compounds of the 
invention that encode the fourth extender module of the FK-520 PKS and the corresponding 
polypeptides encoded thereby are useful for a variety of applications, ln one embodiment, a 
DNA compound comprising a sequence that encodes the FK-520 fourth extender module is 
inserted into a DNA confound that comprises thv. coding sequence for a heterologous PKS. 
The resulting construct, in which the coding sequence for a module of the heterologous PKS 
is either replaced by that for the fourth extender module of the FK-520 PKS or the latter is 
merely added to coding sequences for the modules of the heterologous PKS, provides a 
novel PKS coding sequence. In another embodiment, a DNA compound comprising a 
sequence that encodes the fourth extender module of the FK-520 PKS is inserted into a 
DNA compound that comprises the remainder of the coding sequence for the FK-520 PKS 
or a recombinant FK-520 PKS that produces an FK-520 derivative. 

In another embodiment, a portion of the fourth extender module coding sequence is 
utilized in conjunction with other PKS coding sequences to create a hybrid module. In this 
embodiment, the invention provides, for example, either replacing the ethylmalonyl CoA 
specific AT with a malonyi CoA, methylmalonyl CoA, or 2-hydroxymalonyl CoA specific 
AT; and or deleting the inactive DH, inserting a KR, a KR and an active DH, or a KR. an 
active DH, and an ER. In addition, the KS and'or ACP can be replaced with another KS 
and/or ACP. In each of these replacements or insertions, the heterologous KS, AT, DH, K2R, 
ER, or ACP coding sequence can originate from a coding sequence for another module of 
the FK-520 PKS, a PKS for a polyketide other than FK-520, or from chemical synthesis. 
The resulting heterologous fourth extender module coding sequence can be utilized in 
conjunction with a coding sequence for a PKS that synthesizes FK-520, an FK-520 
derivative, or another polyketide. In similar fashion, the corresponding domains in a module 
of a heterologous PKS can be replaced by one or more domains of the fourth extender 
module of the FK-520 PKS. 

As illustrative examples, the present invention provides recombinant genes, vectors, 
and host cells that result from the conversion of the FK-506 PKS to an FK-520 PKS and 
vice-versa. In one embodiment, the invention provides a recombinant set of FK-506 PKS 
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genes but m which the coding sequences for the iounh extender module or at least those for 
the AT domain m the fourth extender module have been replaced by those for the AT 
domain of the fourth extender module of the FK-520 PKS. Tins recombinant PKS can be 
used to produce FK-520 in recombinant host cells. In another embodiment, the invention 
5 provides a recombinant set of FK-520 PKS genes but in which the coding sequences for the 
fourth extender module or at least those for the AT domain m the fourth extender module 
have been replaced by those for the AT domain of the fourth extender module of the FK- 
506 PKS. This recombinant PKS can be used to produce FK-506 in recombinant host cells. 
Other examples of hybrid PKS enzyrr- T the invention include those in which the 

10 AT domain of module 4 has been replaced with a malonyl specific AT domain to provide a 
PKS that produces 21-desethyl~FK520 or with a methylmalonyl specific AT domain to 
provide a PKS that produces 21-desethyl-21-methy]-FK520. Another hybrid PKS of the 
invention is prepared by replacing the AT and inactive KR domain of FK-520 extender 
module 4 with a methylmalonyl specific AT and an active KR domain, such as, for 

15 example, from module 2 of the DEBS or oleandolide PKS enzymes, to produce 21-desethyl- 
21-methyl-22-desoxo-22-hydroxy-FK520. The compounds produced by these hybrid PKS 
enzymes are neurotrophic. 

The fifth extender module of the FK-520 PKS includes a KS, an AT that binds 
methylmalonyl CoA, a DH. a KR, and an ACP. The recombinant DNA compounds of the 

20 invention that encode the fifth extender module of the FK-520 PKS and the corresponding 
polypeptides encoded thereby are useful for a variety of applications. In one embodiment, a 
DNA compound comprising a sequence that encodes the FK-520 fifth extender module is 
inserted into a DNA compound that composes the coding sequence for a heterologous PKS. 
The resulting construct, in which the coding sequence for a module of the heterologous PKS 

25 is either replaced by that for the fifth extender module of the FK-520 PKS or the latter is 
merely added to coding sequences for the modules of the heterologous PKS, provides a 
novel PKS. In another embodiment, a DNA compound comprising a sequence that encodes 
the fifth extender module of the FK-520 PKS is inserted into a DNA compound that 
comprises the coding sequence for the FK-520 PKS or a recombinant FK-520 PKS that 

30 produces an FK-520 derivative. 

In another embodiment, a portion of the fifth extender module coding sequence is 
utilized in conjunction with other PKS coding sequences to create a hybrid module. In this 
embodiment, the invention provides, for example, either replacing the methylmalonyl CoA 
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specific AT with a malonyl Co A, ethylmaionyl Co A. or 2-hyciroxymalonyl Co A specific 
AT; deleting any one or both of the DH and KR; replacing any one or both of the DH and 
KR with either a KR and/ or DH; and/or inserting an HR. In addition, the KS and- or ACP 
can be replaced with another KS and/or ACP. In each of these replacements or insertions, 
the heterologous KS, AT, DH, KR. ER, or ACP coding sequence can originate from a 
coding sequence for another module of the FK-520 PKS. from a coding sequence for a PKS 
that produces a polyketide other than FK-520, or from chemicai synthesis. The resulting 
heterologous fifth extender module coding sequence can be utilized in conjunction with a 
coding sequence for a PKS that synthesizes r K »n FK-520 derivative, or another 
polyketide. In similar fashion, the corresponding domains in a module of a heterologous 
PKS can be replaced by one or more domains of the fifth extender module of the FK-520 
PKS. 

In an illustrative embodiment, the present invention prov ides a set of recombinant 
FK-520 PKS genes in which the coding sequences for the DH domain of the fifth extender 
module have been deleted or mutated to render the DH non-functional. In one such mutated 
gene, the KR and DH coding sequences are replaced with those encoding only a KR domain 
from another PKS gene. The resulting PKS genes code for the expression of an FK-520 
PKS that produces an FK-520 analog that lacks the C-19 to C-20 double bond of FK-520 
and has a C-20 hydroxyl group. Such analogs are preferred neurotrophins, because they 
have little or no immunosuppressant activity. This recombinant fifth extender module 
coding sequence can be combined with other coding sequences to make additional 
compounds of the invention. In an illustrative embodiment, the present invention provides a 
recombinant FK-520 PKS that contains both this fifth extender module and the recombinant 
fourth extender module described above that comprises the coding sequence for the fourth 
extender module AT domain of the FK-506 PKS. The invention also provides recombinant 
host cells derived from FK-506 producing host cells that have been mutated to prevent 
production of FK-506 but that express this recombinant PKS and so synthesize the 
corresponding (lacking the C-19 to C-20 double bond of FK-506 and having a C-20 
hydroxyl group ) FK-506 derivative. In another embodiment, the present invention provides 
a recombinant FK-506 PKS in which the DH domain of module 5 has been deleted or 
otherwise rendered inactive and thus produces this novel polyketide. 

The sixth extender module of the FK-520 PKS includes a KS, an AT specific for 
methylmalonyl CoA, a KR, a DH, an ER, and an ACP. The recombinant DNA compounds 
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of the invention that encode the sixth extender module of the FK-520 PKS and the 
corresponding polypeptides encoded thereby are useful for a variety of applications. In one 
embodiment, a DNA compound comprising a sequence that encodes the FK-520 sixth 
extender module is inserted into a DNA compound that comprises the coding sequence for a 
5 heterologous PKS. The resulting construct, m which the coding sequence for a module of 
the heterologous PKS is either replaced by that for the sixth extender module of the FK-520 
PKS or the latter is merely added to coding sequences for the modules of the heterologous 
PKS. provides a novel PKS coding sequence. In another embodiment, a DNA compound 
comprising a sequence that encodes the sixth extender mouuie of the FK-520 PKS is 

1 0 inserted into a DNA compound that comprises the coding sequence for the remainder of the 
FK-520 PKS or a recombinant FK-520 PKS that produces an FK-520 derivative. 

In another embodiment, a portion of the sixth extender module coding sequence is 
utilized in conjunction with other PKS coding sequences to create a hybrid module. In this 
embodiment, the invention provides, for example, either replacing the methylmaionyl CoA 

1 5 specific AT with a malonyi CoA, ethylmalonyl CoA, or 2-hydroxymalonyl CoA specific 
AT; deleting any one, two, or all three of the KR, DH, and ER; and/or replacing any one, 
two, or all three of the KR, DH, and ER with another KR, DH, and ER. In addition, the KS 
and/or ACP can be replaced with another KS and/or ACP. in each of these replacements, 
the heterologous KS, AT, DH, KR, ER, or ACP coding sequence can originate from a 

20 coding sequence for another module of the FK-520 PKS. from a coding sequence for a PKS 
that produces a polyketide other than FK-520. or from chemical synthesis. The resulting 
heterologous sixth extender module coding sequence can be utilized in conjunction with a 
coding sequence for a PKS that synthesizes FK-520, an FK-520 derivative, or another 
polyketide. In similar fashion, the corresponding domains in a module of a heterologous 

25 PKS can be replaced by one or more domains of the sixth extender module of the FK-520 
PKS. 

In an illustrative embodiment, the present invention provides a set of recombinant 
FK-520 PKS genes in which the coding sequences for the DH and ER domains of the sixth 
extender module have been deleted or mutated to render them non-functional In one such 
30 mutated gene, the KR, ER, and DH coding sequences are replaced with those encoding only 
a KR domain from another PKS gene. This can also be accomplished by simply replacing 
the coding sequences for extender module six with those for an extender module having a 
methylmaionyl specific AT and only a KR domain from a heterologous PKS gene, such as. 
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for- example, the coding sequences ror extender module two encoded by the envi/sene. The 
resulting PKS genes code for the expression of an FK-520 PKS that produces an FK-520 
analog that has a C-1S hydroxyl group. Such analogs arc preferred neurotrophins. because 
they ha\*e tittle or no immunosuppressant actiyity. This recombinant sixth extender module 
coding sequence can be combined with other coding sequences to make additional 
compounds of the invention. In an illustrative embodiment, the present invention provides a 
recombinant FK-520 PKS that contains both this sixth extender module and the 
recombinant fourth extender module described above thai comprises the coding sequence 
tor the fourth extender module AT domain of the FK-5u6 PKS. The invention also provides 
recombinant host cells derived from FK-506 producing host cells that have been mutated to 
prevent production of FK-506 but that express this recombinant PKS and so synthesize the 
corresponding (having a C-18 hydroxy] group) FK-506 derivative. In another embodiment, 
the present invention provides a recombinant FK-506 PKS in w hich the DH and ER 
domains of module 6 have been deleted or otherwise rendered inactive and thus produces 
this novel polyketide. 

The seventh extender module of the FK-520 PKS includes a KS, an AT specific for 
2-hydroxymalonyl CoA, a KR, a DH, an ER, and an ACP. The recombinant DNA 
compounds of the invention that encode the seventh extender module of the FK-520 PKS 
and the corresponding polypeptides encoded thereby are useful for a variety of applications. 
In one embodiment, a DNA compound composing a sequence that encodes the FK-520 
seventh extender module is inserted into a DNA compound that comprises the coding 
sequence for a heterologous PKS. The resulting construct, in which the coding sequence for 
a module of the heterologous PKS is either replaced by that for the seventh extender module 
of the FK-520 PKS or the latter is merely added to coding sequences for the modules of the 
heterologous PKS, provides a novel PKS coding sequence. In another embodiment, a DNA 
compound comprising a sequence that encod * u e seventh extender module of the FK-520 
PKS is inserted into a DNA compound that compnses the coding sequence for the 
remainder of the FK-520 PKS or a recombinant FK-520 PKS that produces an FK-520 
derivative. 

In another embodiment, a portion or all of the seventh extender module coding 
sequence is utilized in conjunction with other PKS coding sequences to create a hybrid 
module. In this embodiment, the invention provides, for example, either replacing the 2- 
hydroxymalonyl CoA specific AT with a methylmalonyl CoA, ethylmalonyl CoA, or 
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maionyl CoA specific AT: deleting the KR. the DH. and or the HR; and/or replacing the 
KR, DH. and.- or ER. In addition, the KS and or ACP can be replaced with another KS 
and. or ACP. in each of these replacements or insertions, the heterologous KS. AT, DH, KJR. 
ER. or ACP coding sequence can originate from a coding sequence for another module of 
5 the FK-520 PKS, from a coding sequence for a PKS that produces a polyketide other than 
FK-520, or from chemical synthesis. The resulting heterologous seventh extender module 
coding sequence can be utilized in conjunction with a coding sequence for a PKS that 
synthesizes FK-520. an FK-520 derivative, or another polyketide. In similar fashion, the 
corresponding domains in a module of a heterologous PKS can be replaced by one or more 

10 domains of the seventh extender module of the FK-520 PKS. 

In an illustrative embodiment, the present invention provides a set of recombinant 
FK-520 PKS genes in which the coding sequences for the AT domain of the seventh 
extender module has been replaced with those encoding an AT domain for maionyl, 
methylmalonyh or ethylmalonyl CoA from another PKS gene. The resulting PKS genes 

15 code for the expression of an FK-520 PKS that produces an FK-520 analog that lacks the C- 
15 methoxy group, having instead a hydrogen, methyl, or ethyl group at that position, 
respectively. Such analogs are preferred, because they are more slowly metabolized than 
FK-520. This recombinant seventh extender module coding sequence can be combined with 
other coding sequences to make additional compounds of the invention. In an illustrative 

20 embodiment, the present invention provides a recombinant FK-520 PKS that contains both 
this seventh extender module and the recombinant fourth extender module described above 
that comprises the coding sequence for the fourth extender module AT domain of the FK- 
506 PKS. The invention also provides recombinant host cells derived from FK-506 
producing host cells that have been mutated to prevent production of FK-506 but that 

25 express this recombinant PKS and so synthesize the corresponding (C-15-desmethoxy) FK- 
506 derivative. In another embodiment, the present invention provides a recombinant FK- 
506 PKS in which the AT domain of module 7 has been replaced and thus produces this 
novel polyketide. 

In another illustrative embodiment, the present invention provides a hybrid PKS in 
30 which the AT and KR domains of module 7 of the FK-520 PKS are replaced by a 

methylmalonyl specific AT domain and an inactive KR domain, such as, for example, the 
AT and KR domains ;f extender module 6 of the rapamycin PKS. The resulting hybrid PKS 
produces 15-desmethoxy-15-methyl-16-oxo-FK-520, a neurotrophin compound. 
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The eighth extender module of the FK-520 PKS includes a KS. an AT specific tor 2- 
hydroxymalonyl CoA. a KR. and an ACP. The recombinant DNA compounds of the 
invenuon that encode the eighth extender module of the FK-520 PKS and the corresponding 
polypeptides encoded thereby are useful for a variety of applications. In one embodiment, a 
5 DNA compound comprising a sequence that encodes the FK-520 eighth extender module is 
inserted into a DNA compound that comprises the coding sequence for a heterologous PKS. 
The resulting construct, in which the coding sequence for a module of the heterologous PKS 
is either replaced by that for the eighth extender module of the FK-520 PKS or the latter is 
merely added to coding sequences for the modules of the heterologous PKS, pr.. ides a 

in novel PKS coding sequence. In another embodiment, a DNA compound comprising a 
sequence that encodes the eighth extender module of the FK-520 PKS is inserted into a 
DNA compound that comprises the coding sequence for the remainder of the FK-520 PKS 
or a recombinant FK-520 PKS that produces an FK-520 derivative. 

In another embodiment, a portion of the eighth extender moduie coding sequence is 

15 utilized in conjunction with other PKS coding sequences to create a hybrid module. In this 
embodiment, the invention provides, for example, either replacing the 2-hydroxymalonyI 
CoA specific AT with a methylmalonyl CoA, ethylmalonyl CoA, or malonyl CoA specific 
AT; deleting or replacing the KR; and/or inserting a DPI or a DH and an ER. In addition, the 
KS and/or ACP can be replaced with another KS and/or ACP. In each of these 

20 replacements, the heterologous KS. AT, DH. KR, ER, or ACP coding sequence can 

originate from a coding sequence for another module of the FK-520 PKS, from a coding 
sequence for a PKS that produces a poiyketide other than FK-520. or from chemical 
synthesis. The resulting heterologous eighth extender module coding sequence can be 
utilized in conjunction with a PKS that synthesizes FK-520, an FK-520 derivative, or 

25 another poiyketide. In similar fashion, the corresponding domains in a module of a 

heterologous PKS can be replaced by one or more domains of the eighth extender module of 
the FK-520 PKS. 

In an illustrative embodiment, the present invenuon provides a set of recombinant 
FK-520 PKS genes in which the coding sequences for the AT domain of the eighth extender 
30 module has been replaced with those encoding an AT domain for malonyl, methylmalonyl, 
or ethylmalonyl CoA from another PKS gene. The resulting PKS genes code for the 
expression of an FK-520 PKS that produces an FK-520 analog that lacks the C-13 methoxy 
group, having instead a hydrogen, methyl, or ethyl group at that position, respectively. Such 
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analogs arc preferred, because they are more slowly metaboitzcd than FK-520. This 
recombinant eighth extender module coding sequence can be combined with other coding 
sequences to make additional compounds of the invention, in an illustrative embodiment, 
the present invention provides a recombinant FK-520 PKS that contains both this eighth 
extender module and the recombinant fourth extender module described above that 
comprises the coding sequence for the fourth extender module AT domain of the FK-506 
PKS. The invention also provides recombinant host cells derived from FK-506 producing 
host cells that have been mutated to prevent production of FK-506 but that express this 
recombinan r PKS and ^o synthesize the corresponding (C-l 3-desmethoxy) FK-506 
derivative. In another embodiment, the present invention provides a recombinant FK-506 
PKS in which the AT domain of module S has been replaced and thus produces this novel 
polyketide. 

The ninth extender module of the FK-520 PKS includes a KS, an AT specific for 
methylmaionyl CoA, a KR, a DH. an ER. and an ACP. The recombinant DNA compounds 
of the invention that encode the ninth extender module of the FK-520 PKS and the 
corresponding polypeptides encoded thereby are useful for a variety of applications. In one 
embodiment, a DNA compound comprising a sequence that encodes the FK-520 ninth 
extender module is inserted into a DNA compound that comprises the coding sequence for a 
heterologous PKS. The resulting construct, in which the coding sequence for a module of 
the heterologous PKS is either replaced by that for the ninth extender module of the FK-520 
PKS or the latter is merely added to coding sequences for the modules of the heterologous 
PKS, provides a novel PKS coding sequence. In another embodiment, a DNA compound 
comprising a sequence that encodes the ninth extender module of the FK-520 PKS is 
inserted into a DNA compound that comprises the coding sequence for the remainder of the 
FK-520 PKS or a recombinant FK-520 PKS that produces an FK-520 derivative. 

In another embodiment, a portion of the ninth extender module coding sequence is 
utilized in conjunction with other PKS coding sequences to create a hybrid module. In this 
embodiment, the invention provides, for example, either replacing the methylmaionyl CoA 
specific AT with a malonyl CoA, ethylmalonyl CoA, or 2-hydroxymalonyl CoA specific 
AT; deleting any one, two. or all three of the KR, DH, and ER: and/or replacing any one, 
two, or all three of the KR, DH, and ER with another KR. DH, and/or ER. In addition, the 
KS and'or ACP can be replaced with another KS an&'or ACP. In each of these 
replacements, the heterologous KS, AT, DH, KR, ER. or ACP coding sequence can 
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originate from a coding sequence for another module of the FK-520 PKS, from a coding 
sequence for a PKS that produces a polvketide other than FK-520. or from chemical 
synthesis. The resulting Heterologous ninth extender module coding sequence can be 
utilized in conjunction with a PKS that synthesizes FK-520, an FK-520 derivative, or 
> another polvketide. In similar fashion, the corresponding domains in a module of a 

heterologous PKS can be replaced by one or more domains of the ninth extender module of 
the FK-520 PKS. 

The tenth extender module of the FK-520 PKS includes a KS, an AT specific for 
malonv: C A ' i an ACP. The recombinant DNA. comoounds of the invention that e^ode 

10 the tenia extender module of the FK-520 PKS and the corresponding polypeptides encoded 
thereby are useful for a variety of applications. In one embodiment, a DXA compound 
comprising a sequence that encodes the FK-520 tenth extender module is inserted into a 
DNA compound that comprises the coding sequence for a heterologous PKS. The resulting 
construct, in which the coding sequence for a module of the heterologous PKS is either 

15 replaced by that for the tenth extender module of the FK-520 PKS or the latter is merely 
added to coding sequences for the modules of the heterologous PKS, provides a novel PKS 
coding sequence. In another embodiment, a DNA compound comprising a sequence that 
encodes the tenth extender module of the FK-520 PKS is inserted into a DNA compound 
that composes the coding sequence for the remainder of the FK-520 PKS or a recombinant 

20 FK-520 PKS that produces an FK-520 derivative. 

In another embodiment, a portion or ail of the tenth extender module coding 
sequence is utilized in conjunction with other PKS coding sequences to create a hybrid 
module. In this embodiment, the invention provides, for example, either replacing the 
malonyl Co A specific AT with a methylmalonyl Co A, ethylmalonyl Co A, or 2- 

25 hydroxymalonyl CoA specific AT; and/or inserting a KR, a KR and DH, or a KR, DH, and 
an ER. In addition, the KS and/or ACP can be replaced with another KS and/or ACP. In 
each of these replacements or insertions, the heterologous KS, AT, DH, KR, ER, or ACP 
coding sequence can originate from a coding sequence for another module of the FK-520 
PKS, from a coding sequence for a PKS that produces a polyketide other than FK-520, or 

30 from chemical synthesis. The resulting heterologous tenth extender module coding sequence 
can be utilized in conjunction with a coding sequence for a PKS that synthesizes FK-520, an 
FK-520 denvative, or another poiyketide. In similar fashion, the corresponding domains in a 
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module of a heterologous PKS can he replaced by one or more domains of the tenih 
extender moduie of the FK-520 PKS. 

T he FK-52u polyketide precursor produced by the action of the tenth extender 
module of the PKS is then attached to pipecoiic acid ana eych/.ed to form FK-520. The 
5 enzyme FkbP is the NRPS like enzyme that catalyzes these reactions. FkbP also includes a 
thioesterase activity that cleaves the nascent FK-520 polyketide from the NRPS. The 
present invention provides recombinant DNA compounds that encode the fkbP gene and so 
provides recombinant methods for expressing ihe ftbP gene product in recombinant host 
cehs. The recombinant./^ 0 t^enes of the invention include those in which the coding 

10 sequence for the adenylation domain has been mutated or replaced with coding sequences 
from other NRJPS like enzymes so that the resulting recombinant FkbP incorporates a 
moiety other than pipecoiic acid. For the construction of host cells that do not naturally 
produce pipecoiic acid, the present invention provides recombinant DNA compounds that 
express the enzymes that catalyze at least some of the biosynthesis of pipecoiic acid (see 

15 Nielsen et a/., 1991, Biochem. 30: 5789-96). The fkbL gene encodes a homolog of RapL, a 
lysine cyclodeaminase responsible in part for producing the pipecolate unit added to the end 
of the polyketide chain, TheflcbB and fkbL recombinant genes of the invention can be used 
in heterologous hosts to produce compounds such as FK-520 or, in conjunction with other 
PKS or NRPS genes, to produce known or novel polyketides and non-ribosmal peptides. 

20 The present invention also provides recombinant DNA compounds that encode the 

P450 oxidase and methyltransferasc genes involved in the biosynthesis of FK-520. Figure 2 
shows the various sites on the FK-520 polyketide core structure at which these enzymes act. 
By providing these genes in recombinant form, the present invention provides recombinant 
host cells that can produce FK-520. This is accomplished by introducing the recombinant 

25 PKS. P450 oxidase, and methyltransferasc genes into a heterologous host ceil In a preferred 
embodiment, the heterologous host cell is Strep, omyces coelicolor CH999 or Streptomyces 
lividans K4-1 14, as desenbed in U.S. Patent No. 5,830,750 and U.S. patent application 
Serial Nos. 0S/S28,89S, filed 31 Mar. 1997, and 09/181,833, filed 28 Oct. 1998, each of 
which is incorporated herein by reference. In addition, by providing recombinant host cells 

30 that express only a subset of these genes, the present invention provides methods for making 
FK-520 precursor compounds not readily obtainable by other means. 

In a related aspect, the present invention provides recombinant DNA compounds 
and vectors that are useful in generating, by homologous recombination, recombinant host 
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cells that produce FK-520 precursor compounds. In this aspect of the invention, a native 
host cell that produces FK-520 is transformed with a vector (such as an SCP2* derived 
vector for Sircniuwvces host ceils j that encoues one or more disrupted genes (i.e.. a 
hydroxylase, a methyhransrerase. or both) or merely flanking regions from those genes. 
When the vector integrates by homologous recombination, the native, functional gene is 
deleted or replaced by the non-functional recombinant gene, and the resulting host cell thus 
produces an FK-520 precursor. Such host cells can also be complemented by introduction of 
a modified form of the deleted or mutated non-functionai gene to produce a novel 
compound. 

In one important embodiment, the present invention provides a hybrid PKS and the 
corresponding recombinant DNA compounds that encode those hybrid PKS enzymes. For 
purposes of the present invention a hybrid PKS is a recombinant PKS that compnses all or 
part of one or more modules and thioesterase, cyclase domain of a first PKS and all or pan 
of one or more modules, loading module, and thioesterase cyclase domain of a second PKS. 
In one preferred embodiment, the first PKS is all or part of the FK-520 PKS. and the second 
PKS is only a portion or all of a non-FK-520 PKS. 

One example of the preferred embodiment is an FK-520 PKS in which the AT 
domain of module which specifies a hydroxymalonyl CoA and from which the C-13 
methoxy group of FK-520 is derived, is replaced by an AT domain that specifies a malonyl, 
methylmalonyl, or ethylmalonyl CoA. Examples of such replacement AT domains include 
the AT domains from modules 3, 12, and 13 of the rapaymycin PKS and from modules 1 
and 2 of the erythromycin PKS. Such replacements, conducted at the level of the gene for 
the PKS, are illustrated in the examples below. Another illustrative example of such a 
hybrid PKS includes an FK-520 PKS in which the natural loading module has been replaced 
with a loading module of another PKS. Another example of such a hybrid PKS is an FK- 
520 PKS in which the AT domain of module three is replaced with an AT domain that binds 
methylmalonyl CoA. 

In another preferred embodiment, the first PKS is most but not all of a non-FK-520 
PKS, and the second PKS is only a portion or all of the FK-520 PKS. An illustrative 
example of such a hybrid PKS includes an erythromycin PKS in which an AT specific for 
methylmalonyl CoA is replaced with an AT from the FK-520 PKS specfic for malonyl 
CoA. 
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Those of skill in the art will recognize that all or pan of either the first or second 
PKS in a hybrid PKS of the invention need not he isolated from a naturally occurring 
source. For example, only a sniail portion of an Ah domain determines us speciricitv. See 
U.S. provisional patent application Serial No. 60. 091.526. incorporated herein by reference. 
The state of the art in DNA synthesis allows the artisan to construct de novo DNA 
compounds of size sufficient to construct a useful portion of a PKS module or domain. For 
purposes of the present invention, such synthetic DNA compounds are deemed to be a 
portion of a PKS. 

Thus, the hybrid r/.cdules of the invention are incorporated into a PKS to provide a 
hybrid PKS of the invention. A hybrid PKS of the invention can result not only: 

(i) from fusions of heterologous domain (where heterologous means the domains in 
that module are from at least two different naturally occurring modules) coding sequences 
to produce a hybrid module coding sequence contained in a PKS gene whose product is 
incorporated into a PKS, 

but also: 

(ii) from fusions of heterologous module (where heterologous module means two 
modules are adjacent to one another that are not adjacent to one another in naturally 
occurring PKS enzymes) coding sequences to produce a hybrid coding sequence contained 
in a PKS gene whose product is incorporated into a PKS, 

(iii) from expression of one or more FK-520 PKS genes with one or more non-FK- 
520 PKS genes, including both naturally occurring and recombinant non-FK-520 PKS 
genes, and 

(iv) from combinations of the foregoing. 

Various hybrid PKSs of the invention illustrating these various alternatives are described 
herein. 

Examples of the production of a hybrid PKS by co-expression of PKS genes from 
the FK-520 PKS and another non-FK-520 PKS include hybrid PKS enzymes produced by 
coexpression of FK-520 and rapamycin PKS genes. Preferably, such hybrid PKS enzymes 
are produced in recombinant Streptomyc.es host cells that produce FK-520 or FK-506 but 
have been mutated to inactivate the gene whose function is to be replaced by the rapamycin 
PKS gene introduced to produce the hybrid PKS. Particular examples include (i) 
replacement of the fkbC gene with the rapB gene; and (ii) replacement of \t\zjlbA gene with 
the rapC gene. The latter hybrid PKS produces 13.15-didesmethoxy-FKo20, if the host cell 
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is an FK-520 producing host cell, and 13.!5-didesmethoxy-FK-506. if the host cell is an 
FK-506 producing host cell. The compounds produced by these hybrid PKS enzvmes are 
immunosuppressants and neurotrophins but can be readily modified to act only as 
neurotrophic, as described in Example 6. below. 

Other illustrative hybrid PKS enzvmes of the invention are prepared by replacing th 
fkbA gene of an FK-520 or FK-506 producing host cell with a hybrid fkbA gene in which: 

(a) the extender module S through 10, inclusive, coding sequences have been replaced by 
the coding sequnces for extender modules 12 to 14. inclusive, of the rapamvein PKS; and 

(b) the module S codine sequences have been replaced bv the module S coding sequence of 
the rifan . .in PKS. When expressed with the other, naturally occurring FK-520 or FK-506 
PKS genes and the genes of the modification enzymes, the resulting hybrid PKS enzvmes 
produce, respectively, (a) 13-desmethoxy-FK-520 or 13-desmeihoxy-FK-506; and (b) 13- 
desmethoxy-13-methyl-FK-520 or 13-desmethoxy-13-methvl-FKo06. In a preferred 
embodiment, these recombinant PKS genes of the invention are introduced into the 
producing host cell by a vector such as pHU204, which is a plamsid pRM5 derivative that 
has the well-characterized SCP2* replicon. the colEl replicon, the isr and bla resistance 
genes, and a cos site. This vector can be used to introduce the recombinant fkbA 
replacement gene in an FK-520 or FK-506 producing host cell (or a host cell derived 
therefrom in which the endogenous fkbA gene has either been rendered inactive by 
mutation, deletion or homologous recombination with the gene that replaces it) to produce 
the desired hybrid PKS. 

In constructing hybrid PKSs of the invention, certain general methods may be 
helpful. For example, it is often beneficial to retain the framework of the module to be 
altered to make the hybrid PKS. Thus, if one desires to add DFI and ER functionalities to a 
module, it is often preferred to replace the KR domain of the original module with a KR, 
DH, and ER domain-containing segment from another module, instead of merely inserting 
DH and ER domains. One can alter the stereochemical specificity of a module by 
replacement of the KS domain with a KS domain from a module that specifies a different 
stereochemistry. See Lau ei aL, 1999, ''Dissecting the role of acyltransferase domains of 
modular polyketide synthases in the choice and stereochemical fate of extender units," 
Biochemistiy 35(5): 1 643-1 65 1 , incorporated herein by reference. Stereochemistry 7 can also 
be changed by changing the KR domain. Also, one can alter the specificity of an AT 
domain by changing only a small segment of the domain. See Lau et ai. supra. One can 
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also take advantage of known linker regions in PKS proteins to link modules from two 
different PKSs to create a hybrid PKS. See Gokhale t?/ ai, 16 Apr. 1999, "Dissecting and 
Exploiting Intermodular Communication in Polyketide Synthases," Science 264: 4S2-4S5. 
incorporated herein by reference. 
5 The following Table lists references describing illustrative PKS genes and 

corresponding enzymes that can be utilized in the construction of the recombinant PKSs and 
the corresponding DNA compounds that encode them of the invention. Also presented are 
various references describing tailoring enzymes and corresponding genes that can be 
employed in accordance with the methods ji ti. .ent invention. 
10 ivermectin 

U.S. Pat. \ : o. 5,252,474 to Merck. 

MacNeil et al., 1993, Industrial Microorganisms: Basic and Applied Molecular 
Genetics, Baltz, Hegeman. & Skatrud. eds. (ASM), pp. 245-256, A Comparison of the 
Genes Encoding the Polyketide Synthases for Avermectin, Erythromycin, and Nemadectm. 
15 MacNeil eta!., 1992, Gene 1 J 5: 1 19-125, Complex Organization of the 

Streptomyces avermitilis genes encoding the avermectin polyketide synthase. 

Ikeda et aL. Aug. 1999, Organization of the biosynthetic gene cluster for the 
polyketide anthelmintic macrolide avermectin in Streptomyces avermitilis, Proc\ Natl. Acad. 
Set. USA 96: 9509-9514. 
20 Candicidin (FR008) 

Hue/ a/., 1994, Moi Microbiol. 14: 163-172. 
Epothilone 

U.S. Pat. App. Serial No. 60/130,560, filed 22 April 1999. 
Erythromycin 
25 PCT Pub. No. 93/13663 to Abbott. 

US Pat. No. 5,824,513 to Abbott. 
Donadio et aL, 1991, Science 252:675-9. 

Cortes ei aL, 8 Now 1990, Nature J4S:176-S, An unusually large 
multifunctional polypeptide in the erythromycin producing polyketide synthase of 
30 Saccharopolyspora ennhraea. 

Glycosylation Enzymes 

PCT Pat. App. Pub. No. 97/23630 to Abbott. 
FK-506 
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Motamedi et ai. 199S. The biosynthetic gene cluster for the macrolactone nne of 
the immunosuppressant FK-506, Eur. ./. hiochem. 256: 52S-534. 

Motamedi a at.. 1997, Structural organization of a muititunctionai poiyketide 
synthase involved in the biosynthesis of the macrolide immunosuppressant FK-506, Eur. J. 
Biochcm. 24-1: 74-SO. 

Meth\itransferase 

US 5,264,355, issued 23 Now 1993, Methylating enzyme from 
Strepiomyces MA6S5S. 3 1 -O-desmethyl-FK-506 methyltransferase. 

Motamedi et ai. 1996 ^baractenzation of methyltransferase and 
hydroxylase genes involved in the biosynthesis of the immunosuppressants FK-506 and FK- 
520, J. Bactenol. 178: 5243-5248. 
Streptomyces hygroscopicus 

U.S. patent application Serial No. 09/154.0S3. Filed 16 Sep. 199S. 
Lovastatin 

U.S. Pat. No. 5,744.350 to Merck. 
Narbomycin 

U.S. patent application Serial No. 60/107.093, filed 5 Nov. 1998, and Serial No. 
60/120,254, filed 16 Feb. 1999. 
Nemadectin 

MacNeil et ai, 1993, supra. 
Niddamycin 

Kakavas et ai, 1997. Identification and characterization of the niddamycin 
poiyketide synthase genes from Streptomyces caelestis\ J. Bactenol. 179: 7515-7522, 
Oleandomycin 

Swan et ai, 1994, Characterisation of sl Streptomyces antibioticus gene encoding a 
type I poiyketide synthase which has an unusual coding sequence, Xlol Gen. Genet. 242: 
358-362. 

U.S. patent application Serial No. 60 / 120.254, filed 16 Feb. 1999. 

Olano et a!.. 199S. Analysis of a Streptomyces antibioticus chromosomal region 
involved in oleandomycin biosynthesis, which encodes two glycosyltransferases responsible 
for glycosylation of the macrolactone ring. Mol. Gen. Genet. 259(3): 299-308. 
Picromycin 

PCT patent application US99/15047, filed 2 Jul. 1999. 
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Xue et aU 199S. Hydroxyiauon of macrolactones VC-17 and narbomycin is 
mediated by the p/A'C-encoded cytochrome P450 in Sireptomyces Venezuelan Chemisin- & 
Biology 5{ 1 1 ); 061-667. 

Xue et ai. % Oct. 1998, A gene cluster for macroiide antibiotic biosynthesis in 
Sireptomyces venezuelae: Architecture of metabolic diversity. Proc. Natl. Acad. Set. USA 
95: 12111 121 16. 
Platenoiide 

EP Pat. App. Pub. No. 791.656 to Lilly. 
P npamyciri 

Schwecke ei al., Aug. 1995, The biosynthetic gene cluster for the 
polyketide rapamycin. Proc. Natl. Acad. Set. USA PJ?:7830-S43. 

Apancio et ai. 1996, Organization of the biosynthetic gene cluster for rapamvein in 
Sireptomyces hvgroscopicus: analysis of the enzymatic domains in the modular polyketide 
synthase. Gene 169: 9-16. 
Rifamycin 

August et aU 13 Feb. 1998, Biosynthesis of the ansamycin antibiotic rifamycin: 
deductions from the molecular analysis of the /-//"biosynthetic gene cluster of Amvcolatopsis 
mediterranei S669, Chemistry & Biology. 5(2): 69-79. 
Sorangium PKS 

U.S. patent application Serial No. 09 144,055, filed 31 Aug. 1998. 
Soraphen 

U.S. Pat. No. 5,716,849 to Novartis. 

Schupp et ai, 1995, J. Bacteriology 177: 3673-3679. A Sorangium celluloswn 
(Mycobacterium) Gene Cluster for the Biosynthesis of the Macroiide Antibiotic Soraphen 
A: Cloning, Characterization, and Homology to Polyketide Synthase Genes from 
Actinomvcetes. 
Spiramycin 

U.S. Pat. No. 5,098,837 to Lilly. 

Activator Gene 

U.S. Pat. No. 5,514,544 to Lilly. 
Tylosin 

EP Pub. No. 79 1 .655 to Lilly. 
U.S. Pat. No. 5,876,991 to Lilly. 
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Kuhstoss et at.. 1996. Gene I S3 :23 1 -6., Production of a novel polykende through 
the construction of a hybrid polyketide synthase. 
Tailorin g enzymes 

Merson-Davies and Cundliffe. 1994, MoL Microhm!. I3\ 349-355. Analysis of five 
5 lylosin biosynthetic genes from the tylBA region of the Strepiomyces fradiae genome. 

As the above Table illustrates, there are a wide variety of polyketide synthase genes 
that serve as readily available sources of DNA and sequence information for use in 
constructing the hybrid PKS-encoding DNA compounds of the invention. Methods for 
constructing hybrid PKS-encodmg DNA compounds are described without reference to the 

10 FK-520 PKS in PCT patent publication No. 9S 51695; U.S. Patent Nos. 5,672,491 and 
5,712,146 and U.S. patent application Serial Nos. 09/073,538. filed 6 May 199S, and 
09 14 1,908, filed 2S Aug 1998, each of which is incorporated herein by reference. 

The hybrid PKS-encoding DNA compounds of the invention can be and often are 
hybrids of more than two PKS genes. Moreover, there are often two or more modules in the 

1 5 hybrid PKS in which all or part of the module is derived from a second (or third) PKS. 

Thus, as one illustrative example, the present invention provides a hybrid FK-520 PKS that 
contains the naturally occurring loading module and FkbP as well as modules one, two, 
four, six, seven, and eight, nine, and ten of the FK-520 PKS and further contains hybrid or 
heterologous modules three and five. Hybrid or heterologous module three contains an AT 

20 domain that is specific of methylmalonyl CoA and can be derived for example, from the 

erythromycin or rapamycm PKS genes. Hybrid or heterologous module five contains an AT 
domain that is specific for malonyi CoA and can be derived for example, from the 
picromycin or rapamycin PKS genes. 

While an important embodiment of the present invention relates to hybrid PKS 

25 enzymes and corresponding genes, the present invention also provides recombinant FK-520 
PKS genes in which there is no second PKS gene sequence present but which differ from 
the FK-520 PKS ^ene bv one or more deletions. The deletions can encompass one or more 
modules and/or can be limited to a partial deletion within one or more modules. When a 
deletion encompasses an entire module, the resulting FK-520 derivative is at least two 

30 carbons shorter than the gene from which it was derived. When a deletion is within a 

module, the deletion typically encompasses a KR. DH, or ER domain, or both DH and ER 
domains, or both KR and DH domains, or all tliree KR, DH. and ER domains. 
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To construct a hybrid PKS or FK-52U derivative PKS gene of the invention, one can 
employ a technique, described in PCT Pub. No. 9S/27203 and U.S. patent application Serial 
\o. US 989.372. filed i 1 Dec. 1997, each of which is incorporated herein by reference, in 
which the large PKS gene is divided into two or more, typically three, segments, and each 
5 segment is placed on a separate expression vector. In this manner, each of the segments of 
the gene can be altered, and various altered segments can be combined in a single host cell 
to provide a recombinant PKS gene of the invention. This technique makes more efficient 
the construction of large libraries of recombinant PKS genes, vectors for expressing those 
genes, and host cells comprising those vectors. 

1 0 Thus, in one important embodiment, trie recombinant DN A compounds of the 

invention are expression vectors. As used herein, the term expression vector refers to any 
nucleic acid that can be introduced into a host cell or cell-free transcription and translation 
medium. An expression vector can be maintained stably or transiently in a cell, whether as 
part of the chromosomal or other DNA in the cell or in any cellular compartment, such as a 

15 replicating vector in the cytoplasm. An expression vector also comprises a gene that serves 
to produce RNA that is translated into a polypeptide in the cell or cell extract. Furthermore, 
expression vectors typically contain additional functional elements, such as resistance- 
conferring genes to act as selectable markers. 

The various components of an expression vector can vary widely, depending on the 

20 intended use of the vector. In particular, the components depend on the host cell(s) in which 
the vector will be used or is intended to function. Vector components for expression and 
maintenance of vectors in E. coli are widely known and commercially available, as are 
vector components for other commonly used organisms, such as yeast cells and 
Sireptomyces cells. 

25 In a preferred embodiment, the expression vectors of the invention are used to 

construct recombinant Sireptomyces host cells that express a recombinant PKS of the 
invention. Preferred Sireptomyces host cell/vector combinations of the invention include S. 
coelicolor CH999 and S. lividans K4-1 14 host cells, which do not produce actinorhodin, 
and expression vectors derived from the pRMl and pRM5 vectors, as described in U.S. 

30 Patent No. 5,830.750 and U.S. patent application Serial Nos. 08/828,898, filed 3 1 Mar. 
1997, and 09/181,833. filed 28 Oct, 1998. each of which is incorporated herein by 
reference. 
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The present invention provides a wide variety of expression vectors for use in 
Streptomvces. For replicating vectors, the origin of replication can be, for example and 
without limitation, a low copy number vector, such as SCP2" (see Hopwood et ai^ Genetic 
Manipulation of Sireptomvces: A Laboratory manual (Tiie John Innes Foundation, 
Norwich. U.K., 1985); Lydiate et ai* 1985. Gene 35: 223-235; and Kieser and Melton, 

1988, Gene 65: 83-91, each of which is incorporated herein by reference), SLP1.2 
(Thompson et ai. 19S2. Gene 20: 51-62, incorporated herein by reference), and SG5('ts) 
(Muth et al., 19S9, Mol. Gen. Genet. 219: 341-348. and Biennan et al.^ 1992, Gene 116: 43- 
49, each of which is incorporated herein hv re t " rtr ~ r, ^e), or a high copy number vector, such 
asplJKJl andpJVl (see Katze/a/., 1983,7. Gen. Microbial 29: 2703-2714; \v:,.etal., 

1989, 7. BacterioL 171: 5782-5781; and Servin-Gonzale/.. 1993. Plasmid 30: 131-140. each 
of which is incorporated herein by reference). Generally, however, high copy number 
vectors are not preferred for expression of genes contained on large segments of DKA. For 
non-replicating and integrating vectors, it is useful to include at least an E. colt origin of 
replication, such as from pUC, p IP, p 1 1, and pBR. For phage based vectors, the phages 
phiC31 and KC515 can be employed (see Hopwood et aL. supra). 

Typically, the expression vector will comprise one or more marker genes by which 
host cells containing the vector can be identified and/or selected. Useful antibiotic resistance 
conferring genes for use in Sireptomvces host cells include the ennE (confers resistance to 
erythromycin and other macrolides and lincomycm). tsr (confers resistance to thiostrepton), 
aadA (confers resistance to spectinomycin and streptomycin), aacC4 (confers resistance to 
apramycin, kanamyem. gentamicin, geneticm (G41S). and neomycin), hyg (confers 
resistance to hygromycinj, and vph (confers resistance to viomycin ) resistance conferring 
genes. 

The recombinant PKS gene on the vectoi will be under the control of a promoter, 
typically with an attendant nbosome binding site sequence. The present invention provides 
the endogenous promoters of the FK-520 PKS and related biosynthetic genes in 
recombinant form, and these promoters are preferred for use in the native hosts and in 
heterologous hosts in which the promoters function. A preferred promoter of the invention 
is the fkhO gene promoter, comprised in a sequence of about 270 bp between the start of the 
open reading frames of the fkbO mdfkbB genes. The fkbO promoter is believed to be bi- 
directional in that it promotes transcription of the genes jkbO.JkbP. andfkbA in one 
direction and fkbB< fkbC, <mdfkbL in the other. Thus, in one aspect, the present invention 
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prov ides a recombinant expression vector comprising the promoter of the fkbO gene of an 
FK-520 producing organism positioned to transcribe a gene other than /TrftO. In a preferred 
emoodiment the transcribed gene is an FK-52u PKS gene. In another preferred embodiment, 
the transcribed gene is a gene that encodes a protein comprised in a hyond PKS. 
5 Heterologous promoters can also be employed and are preferred for use in host cells 

in which the endogenous FK-520 PKS gene promoters do not function or function poorly. A 
preferred heterologous promoter is the act! promoter and its attendant activator gene actll- 
ORF4, which is provided in the pRMl and pRM5 expression vectors, supra. This promoter 
is acnv:: r . ' '■• stationary phase of growth when sec^nd;^/ metabolites are normal^ 

10 synthesized, Other useful Sireptomyces promoters include without limitation those from the 
ermE gene and the melCl gene, which act constitutively, and the tipA gene and the merA 
gene, which can be induced at any growth stage. In addition, the T7 RNA polymerase 
system has been transferred to Sireptomyces and can be employed in the vectors and host 
cells of the invention. In this system, the coding sequence for the T7 RNA polymerase is 

1 5 inserted into a neutral sue of the chromosome or in a vector under the control of the 
inducible merA promoter, and the gene of interest is placed under the control of the T7 
promoter. As noted above, one or more activator genes can also be employed to enhance the 
activity of a promoter. Activator genes in addition to the actII-ORF4 gene discussed above 
include dnrL redD, and ptpA genes (see U.S. patent application Serial No. 09/181,833, 

20 supra) to activate promoters under their control. 

In addition to providing recombinant DNA compounds that encode the FK-520 
PKS, the present invention also provides DNA compounds that encode the ethylmalonyl 
CoA and 2-hydroxymalonyl CoA utilized in the synthesis of FK-520. Thus, the present 
invention also provides recombinant host cells that express the genes required for the 

25 biosynthesis of ethylmalonyl CoA and 2-hydroxymalonyl CoA. Figures 3 and 4 show the 
location of these genes on the cosmids of the invention and the biosymhenc pathway that 
produces ethylmalonyl CoA. 

For 2-hydroxymalonyl CoA biosynthesis, the fkbH \ jkbl fkbJ \ and fkbK genes are 
sufficient to confer this ability on Streptomcyces host cells. For conversion of 2- 

30 hydroxymalonvl to 2-methoxymalonyL ihefkbG gene is also employed. While the complete 
coding sequence (or fkbH is provided on the cosmids of the invention, the sequence for this 
gene provided herein may be missing a T residue, based on a comparison made with a 
similar s^enc cloned from the ansamitocin gene cluster by Dr. H. Floss. Where the sequence 
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herein shows one T. there may be two, resulting in an extension of the /A.7>// reading frame 
to encode the amino acid sequence: 

NlTI\'KCLAA\mDNTL\VRGT\TEDDEV\"LTDhlRE\"iTTLDDRGILOAVASKNDHD 
LA\VTRLERLG\\AEVI-ATARIG\VGPK:SOS\'REIATr:LNTAPTTIAFIDDQPAER-\EVA 
5 FHLPEVRC\'PAEO^TLLSLPEFSPPVST\'DSRRRRLMVOAGF.ARDQAREAYSGPD 
EDFLRSLDLSMTIAPAGEEELSRVEELTLRTSQ\lNATG\'FiYSDADLRALLTDPAHE 
\T\ATMGDRFGPHGAVGIILLEKKPST^ 

ARAGAJ-ILVADFRRTDRmMMEIAYRFAGFADSDCPCVSEVAGAS-AAGVERLHLEP 
SARP/VPTTLTLT.A.ADIAPVTVSAAG. 

10 For ethylmalonyl CoA biosynthesis, one requires oniy a crotonyl CoA reductase, 

which can he supplied by the host cell hut can also be supplied by recombinant expression 
otihc/kbS gene of the present invention. To increase yield of ethylmalonyl CoA. one can 
also express the fkbE and fkbU genes as well. While such production can be achieved using 
on!v the recombinant genes above, one can also achieve such production by placing into the 

15 recombinant host cell a large segment of the DNA provided by the cosmids of the invention. 
Thus, for 2-hydroxymalonyl and 2-methoxymaionyl CoA biosynthesis, one can simply 
provide the cells with the segment of DNA located on the left side of the FK-520 PKS genes 
shown in Figure 1 . For ethylmalonyl CoA biosynthesis, one can simply provide the cells 
with the segment of DNA located on the nght side of the FK-520 PKS genes shown in 

20 Figure 1 or, alternatively, both the right and left segments of DNA. 

The recombinant DNA expression vectors that encode these genes can be used to 
construct recombinant host cells that can make these important polykende building biocks 
from cells that otherwise are unable to produce them. For example, Streptomyces coelicolor 
and Streptomyces lividans do not synthesisze ethylmalonyl CoA or 2-hydroxymalonyl CoA. 

25 The invention provides methods and vectors for constructing recombinant Streptomyces 

coelicolor and Streptomyces lividans that are able to synthesize either or both ethylmalonyl 
CoA and 2-hydroxymalonyl CoA. These host cells are thus able to make polyketides, those 
requiring these substrates, that cannot otherwise be made in such cells. 

In a preferred embodiment, the present invention provides recombinant 

30 Streptomyces host cells, such as S. coelicolor and S. lividans. that have been transformed 
with a recombinant vector of the invention that codes for the expression of the ethylmalonyl 
CoA biosymhetie genes. The resulting host ceils produce ethylmalonyl CoA and so are 
preferred host cells for the production of polyketides produced by PKS enzymes that 
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comprise one or more AT domains specific for ethyimalonyl CoA. Illustrative PKS 
enzymes of tins type include the FK-520 PKS and a recombinant PKS in which one or more 
AT domains is specific Tor ethyimalonyl CoA. 

In a related embodiment, the present invention provides Streptomyces host cells in 
5 which one or more of the ethyimalonyl or 2-hydroxymaionyi biosynthetic genes have been 
deleted by homologous recombination or rendered inactive by mutation. For example, 
deletion or inactivation of the /fo^G gene can prevent formation of the methoxyi groups at C- 
13 and C-15 of FK-520 (or, in the corresponding FK-506 producing cell, FK-506), leading 
to the production of 13,15-didesmethoxy-l5,15-dihydroxy-FK-520 (or, in the 

10 corresponding FK-506 producing cell, 13,1 5-didesmethoxy- 1 3 A 5-dihydroxy-FK-506). If 
the fkhG gene product acts on 2-hydroxymalonyl and the resulting 2-methoxymalonyl 
substrate is required for incorporation by the PKS, the AT domains of modules 7 and S may 
bind malonyl CoA and methylmalonyl CoA. Such incorporation results in the production of 
a mixture of polyketides in which the methoxy groups at CM 3 and C-15 of FK-520 (or FK- 

1 5 506) are replaced by either hydrogen or methyl. 

This possibility of non-specific binding results from the construction of a hybrid 
PKS of the invention in which the AT domain of module S of the FK-520 PKS replaced the 
AT domain of module 6 of DEBS. The resulting PKS produced, in Streptomyces lividans, 
6-dEB and 2-desmethyl-6-dEB, indicating that the AT domain of module S of the FK-520 

20 PKS could bind malonyl CoA and methylmalonyl CoA substrates. Thus, one could possibly 
also prepare the 13. 15-didesmethoxy-FK-520 and corresponding FK-506 compounds of the 
invention by deleting or othenvise inactivating one or more or all of the genes required for 
2-hydroxymalonvi CoA biosynthesis, i.e., ihefkhH.fkhLfkbJ. and fkbK genes. In any 
event, the deletion or inactivation of one or more biosynthetic genes required for 

25 ethyimalonyl and/or 2-hydroxymalonyl production prevents the formation of polyketides 
requiring ethvlmalonvl and/or 2-hydroxymalonyl for biosynthesis, and the resulting host 
cells are thus preferred for production of polyketides that do not require the same. 

The host cells of the invention can be grown and fermented under conditions known 
in the art for other purposes to produce the compounds of the invention. See, e.g., U.S. 

30 Patent Nos. 5,194,378; 5,1 16,756; and 5,494.820. incorporated herein by reference, for 
suitable fermentation processes. The compounds of the invention can be isolated from the 
fermentation broths of these cultured cells and punned by standard procedures. Preferred 
compounds of the invention include the following compounds: 1 3-desmethoxy~FK-506; 13- 
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desmethoxy-FK-520; 13.15-didesmethoxy-FK-506; 13.15-didesmethoxy-FK-520; 13- 
desmethoxy- 1 8-hydroxy-FK-506; I3-desmethoxy- 1 S-hydroxy-FK-520; 13,1 5- 
didesmethoxy- 1 S-hydroxy-FK-50o; and i 3. i 5-didesmethoxy- i S-hydroxy-FK-520. These 
compounds can be further modified as described for tacrolimus and FK-520 in U.S. Patent 
Nos. 5,225.403; 5,1S9,042; 5.164.495; 5JJ6S.323; 4.9S0.4oo; and 4,920.218, incorporated 
herein by reference. 

Other compounds of the invention are shown in Figure S, Parts A and B. In Figure 8, 
Part A. illustrative C-32-substituted compounds of the invention are shown in two columns 
under the heading R. The substituted compounds are preferred for topical administration 
and are applied to the dermis for treatment . conditions such as psoriasis. In Figure 8, Part 
B, illustrative reaction schemes for making the compounds shown in Figure 8, Part A, are 
provided. In the upper scheme in Figure S. Part B, the C-32 substitution is a tetrazole 
moiety, illustrative of the groups shown in the left column under R in Figure 8, Part A. In 
the lower scheme in Figure 8, Pan B. the C-32 substitution is a disubstituted amino group, 
where R 3 and Ri can be any group similar to the illustrative groups shown attached to the 
amine in the right column under R in Figure 8. Part A. While Figure 8 shows the C-32- 
substituted compounds in which the C-15-methoxy is present, the invention includes these 
C-32-substituted compounds in which C-15 is ethyl, methyl, or hydrogen. Also, while C-21 
is shown as substituted with ethyl orallyl the compounds of the invention includes the C- 
32-substituted compounds in which C-21 is substituted with hydrogen or methyl. 

To make these C-32-substituted compounds. Figure S, Pan B. provides illustrative 
reaction schemes. Thus, a selective reaction of the starting compound (see Figure 8, Part B, 
for an illustrative staning compound) with trifluoromethanesulfonic anhydride in the 
presence of a base yields the C-32 O-trifiate derivarive, as shown in the upper scheme of 
Figure 8, Pan B. Displacement of the triflate with lH-tetrazole or triazole derivatives 
provides the C-32 tetrazole or teiazole derivative. As shown in the lower scheme of Figure 
8, Part B, reacting the staning compound with p-nitrophenylchloroibrmate yields the 
correspoinding carbonate, which, upon displacement with an ammo compound, provides the 
corresponding carbamate derivative. 

The compounds can be readily formulated to provide the pharmaceutical 
compositions of the invention. The pharmaceutical compositions of the invention can be 
used in the form of a pharmaceutical preparation, for example, in solid, semisolid, or liquid 
form. This preparation contains one or more of the compounds of the invention as an active 
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ingredient m admixture with an organic or inorganic carrier or excipient suitable for 
external, enteral, or parenteral application. The active ingredient may be compounded, for 
example, with the usual non-toxic, pharmaceutical!)' acceptable carriers for tablets, pellets, 
capsules, suppositories, solutions, emulsions, suspensions, and any other form suitable for 
use. Suitable formulation processes and compositions for the compounds of the present 
invention are described with respect to tacrolimus in U.S. Patent Nos. 5,939,427; 5,922,729; 
5,3S5,907; 5.338,6S4; and 5,260,301, incorporated herein by reference. Manv of the 
compounds of the invention contain one or more chiral centers, and all of the stereoisomers 
are included within the .cope oi «:;^ invention, as pure compounds as well as mixtures of 
stereoisomers. Thus the compounds of the invention may be supplied as a mixture of 
stereoisomers in any proportion. 

The earners which can be used include water, glucose, lactose, gum acacia, gelatin, 
mannitol, starch paste, magnesium trisilicate. talc, com starch, keratin, colloidal silica, 
potato starch, urea, and other earners suitable for use in manufacturing preparations, in 
solid, semi-solid, or liquified form. In addition, auxiliary stabilizing, thickening, and 
colonng agents and perfumes may be used. For example, the compounds of the invention 
may be utilized with hydroxypropyl methylcellulose essentially as desenbed in U.S. Patent 
No. 4,916,138, incorporated herein by reference, or with a surfactant essentially as 
desenbed in EPO patent publication No. 42S,169, incorporated herein by reference. 

Oral dosage forms may be prepared essentially as described by Hondo et aL, 1937, 
Transplantation Proceedings XIX, Supp. 6: 17-22, incorporated herein by reference. Dosage 
forms for external application may be prepared essentially as desenbed in EPO patent 
publication No. 423,714, incorporated herein by reference. The active compound is included 
in the pharmaceutical composition in an amount sufficient to produce the desired effect 
upon the disease process or condition. 

For the treatment of conditions and diseases relating to immunosuppression or 
neuronal damage, a compound of the invention may be administered orally, topically, 
parenterals, by inhalation spray, or rectally in dosage unit formulations containing 
conventional non-toxic pharmaceutically acceptable earners, adjuvant, and vehicles. The 
term parenteral, as used herein, includes subcutaneous injections, and intravenous, 
intramuscular, and intrasternal injection or infusion techniques. 

Dosage levels of the compounds of the present invention are of the order from about 
0.01 mg to about 50 mg per kilogram of body weight per day, preferably from about 0. 1 mg 
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to about 10 mg per kilogram of body weight per day. The dosage levels are useful in the 
treatment of the above-indicated conditions (from about O. 7 nig to about 3.5 ma per patient 
per day, assuming a 70 kg patient). In addition, the compounds of the present invention may 
be administered on an intermittent basis, i.e., at semi-weekiv. weeklv, semi-monthly, or 
5 monthly intervals. 

The amount of active ingredient that may be combined with the carrier materials to 
produce a single dosage form will vary depending upon the host treated and the particular 
mode of administration. For example, a formulation intended for oral administration to 
humans mav rnrrain from 0.5 mg to 5 g of active agent compounded with an appropriate 

10 and convenient amount of earner material, which may vary from about 5 percent to about 
95 percent of the total composition. Dosage unit forms will generally contain from about 0.5 
mg to about 500 mg of active ingredient. For external administration, the compounds of the 
invention can be formulated within the range of, for example, 0.00001% to 60% by weight, 
preferably from 0.001% to 10% by weight, and most preferably from about 0.005% to 0.8% 

1 5 by weight. The compounds and compositions of the invention are useful in treating disease 
conditions using doses and administration schedules as described for tacrolimus in U.S. 
Patent Nos. 5,542,436; 5,365,948; 5,348,966; and 5,196,437, incorporated herein by 
reference. The compounds of the invention can be used as single therapeutic agents or in 
combination with other therapeutic agents. Drugs that can be usefully combined with 

20 compounds of the invention include one or more immunosuppressant agents such as 
rapamycin, cyclosporin A, FK-506, or one or more neurotrophic agents. 

It will be understood, however, that the specific dosage level for any particular 
patient will depend on a variety of factors. These factors include the activity of the specific 
compound employed; the age, body weight, general health, sex. and diet of the subject; the 

25 time and route of administration and the rate of excretion of the drug; whether a drug 
combination is employed in the treatment; and the severity of the particular disease or 
condition for which therapy is sought. 

A detailed description of the invention having been provided above, the following 
examples are given for the purpose of illustrating the present invention and shall not be 

30 construed as being a limitation on the scope of the invention or claims. 

Example 1 

Replacement of Methoxyl with Hydrogen or Methyl at C-13 of FK-520 
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The C-13 methoxyl group is introduced into FK-520 via an AT domain in extender 
module 8 of the PKS that is specific for hydroxymalonyl and by methylation of the 
hydroxyl group oy an S-adenosyl methionine (SAM) dependent methyltransferase. 
Metabolism uf FK-506 and FK-520 primarily involves oxidation at the C-13 position into 
5 an inactive derivative that is further degraded by host P450 and other enzymes. The present 
invention provides compounds related in structure to FK-506 and FK-520 that do not 
contain the C-13 methoxy group and exhibit greater stability and a longer half-life in vivo. 
These compounds are useful medicaments due to their immunosuppressive and 
neurotrophic activities, and f - ■nvenfon provides the compounds in punned form and as 

10 pharmaceutical compositions. 

The present invention also provides the novel PKS enzymes that produce these 
novel compounds as well as the expression vectors and host cells that produce the novel 
PKS enzymes. The novel PKS enzymes include, among others, those that contain an AT 
domain specific for either malonyl CoA or methylmalonyl CoA in module 8 of the FK-506 

15 and FK-520 PKS. This example describes the construction of recombinant DNA 

compounds that encode the novel FK-520 PKS enzymes and the transformation of host cells 
with those recombinant DNA compounds to produce the novel PKS enzymes and the 
polyketides produced thereby. 

To construct an expression cassette for performing module 8 AT domain 

20 replacements in the FK-520 PKS. a 4.6 kb Sphl fragment from the FK-520 gene cluster was 
cloned into plasmid pLitmus 38 (a cloning vector available from New England Biolabs). 
The 4.6 kb Sphl fragment, which encodes the ACP domain of module 7 followed by module 
8 through the KR domain, was isolated from an agarose gel after digesting the cosmid 
pKOS65-C31 with Sph I. The clone having the insert onented so the single Sad site w r as 

25 nearest to the Spel end of the polylinker was identified and designated as plasmid pKOS60- 
21-67. To generate appropriate cloning sites, two linkers were Hgated sequentially as 
follows. First, a linker was hgated between the Spel and Sad sites to introduce a BgKl site 
at the 5' end of the cassette, to eliminate interfering polylinker sites, and to reduce the total 
insert size to 4.5 kb (the limit of the phage KC515). The ligation reactions contained 5 

30 picomolar unphosphorylated linker DNA and 0. 1 picomoiar vector DNA, i.e.. a 50-fold 

molar excess of linker to vector. The linker had the following sequence: 

S^CTAGTGGGCAGATCTGGCAGCTo' 
3 1 - ACCCGTCT AG ACCG-5 1 

The resulting plasmid was designated pKOS60-27- 1 . 
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Next, a linker of the following sequence was ligaied between the unique Sph\ and 

.-1/711 sues of plasnncl pKOS60-27-l to introduce an ;V.y/I site at the 3' end of the module S 

cassette. The linker employed was: 

5 , -GGGATGCATGGC-3 > 
3 '-GTACCCCTACGTACCGAATT-5 1 

The resulting plasmid was designated pKOS60-29-55. 

To allow in-frame insertions of alternative AT domains, sites were engineered at the 

5 m end (Avr II or She I) and 3' end (Xlio 1) of the AT domain using the polymerase chain 

reaction (PGR) as follows. Plasmid pKOS60-29-55 was used as a template for die PCR and 

sequence 5' to the AT domain was amplified with the primers SpeBgUfwd ana either Avr- 

rev or Nhe-rev: 

SpeBgl-fwd 5 ' -CG ACTC ACTAGTGGGC AG ATCTGG-3 n 

Avr-rev S^CACGCCTAGGCCGGTCGGTCTCGGGCCAC^V 

Nhe-rev 5'-GCGGCTAGCTGCTCGCCCATCGCGGGATGC-3' 

The PCR included, in a 50 ul reaction, 5 \x\ of 1 Ox Pfu polymerase buffer 

(Stratagene), 5 ul lOx z-dNTP mixture (2 mM dATP. 2 m.\l dCTP, 2 mM dTTP, 1 rnM 

dGTP, 1 mM 7-deaza-GTP), 5 /j. I DMSO, 2 ul of each primer (10 uM), 1 ^1 of template 

DNA (0.1 ug/ul), and 1 jil of cloned Pfu polymerase (Straiagene). The PCR conditions 

were 95°C for 2 mm., 25 cycles at 95°C for 30 sec. 60°C for 30 sec, and 72°C for 4 mm., 

followed by 4 min. at 72°C and a hold at 0°C. The amplified DNA products and the Litmus 

vectors were cut with the appropriate restriction enzymes (Bg!U and Avrll or Spel and 

Nhel), and cloned into either pLitmus 2S or pLitmus3S (New Hngland Biolabs), 

respectively, to generate the constructs designated pKOS60-37-4 and pKOS60-37-2, 

respectively. 

Plasmid pKOS60-29-55 was again used as a template for PCR to amplify sequence 
3 T to the AT domain using the primers BsrXho-fwd and NsiAfl-rcv: 

BsrXho-ftvd 5*-GATGTACAGCTCGAGTCGGCACGCCCGGCCGCATC-3 > 
Nsi Afl-rev 5 '-CGACTC ACTTAAGCCATGCATCC-3 " 

PCR conditions were as described above. The PCR fragment was cut with BsrGl 
and Afllh gel isolated, and ligated into pKOS60~37-4 cut with Asp7]& and Aflll and inserted 
into pKOS60-37-2 cut with BsrGl and AfTll to give the piasmids pKOS60-39-l and 
pKOS60-39-13, respectively. These two piasmids can be digested with/hrll andA7;oI or 
Nhel and A7/oI, resoectively, to insert heterologous AT domains specific for malonyl, 
methylmalonyl, ethylmalonyl, or other extender units. 
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Malonyl and methylmalonyl-spccific AT domains were cloned from the rapamycin 
cluster using PCR amplification with a pair of primers that introduce an Avrll or Nhe\ site at 
the 5' end and an XJiol sue at the 3 s end. The PCR conditions were as given above and the 
primer sequences were as follows: 

5 

RATNl 5'-ATCCTAGGCGGGCRGGYGTGTCGTCCTTCGG-3' 
(T end of Rap KS sequence and universal for malonyl and mcthvlmalonvl CoA). 
RATMN2 S'-ATGCTAGCCGCCGCGTTCCCCGTCTTCGCGCGO' 
(Rap AT shorter version 5'- sequence and specific for malonv! CoA). 
1 0 RATMMN2 5 '- ATGCT AGCGGATTCGTCGGTGGTGTT CGCCG A-3 ' 

(Rap AT shorter version sequence and specif: for methvlmalonvl CoA). and 
RATC 5 '-ATCTCGAGCCAGTASCGCTGGTGYTGGAAGG-V 
(Rap DM 5'- sequence and universal for malonyl and methyirnalonyl CoA). 
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10 Because of the high sequence similarity in each module of the rapamycin cluster. 

each primer was expected to prime any of the AT domains. PCR products representing ATs 
specific for malonyl or methylmalony 1 extenders were identified by sequencing individual 
cloned products. Sequencing also confirmed that tne chosen clones contained no 
cloning artifacts. Examples of hybrid modules with the rapamycin AT 12 and ATI 3 

1 5 domains are shown in a separate figure. 

The Avrll-XIiol restriction fragment that encodes module 8 of the FK-520 PKS with 
the endogenous AT domain replaced by the AT domain of module 12 of the rapamycin PKS 
has the DNA sequence and encodes the amino acid sequence shown below. The AT of rap 
module 12 is specific for incorporation of malonyl units. 

20 .^GAGCG GGCAGCTCGCCGAA.GCGCTGCTGA.CGCTCGTCCGGGAGAGCACC 5 0 
IWQLAEALLTLVREST 
GCCGCCG7GCTCGGCCACGTGGGTGGCGAGGACATCCCCGCGACGGCGGC 100 

A A V L G H V G G E D I ? A T A A 
GTTCAAGGACCTCGGCATCGACTCGCTCACCGCGG7CCAGCTGCGCAACG 150 
25 F K D L G I D S L T A V Q L R N 

G CC7CACCG A^GGCG ACCGGTGTGCGGCTG AACGCv-.'^CG'^CGG i ^ ± - C^-.-C 2 00 
A L T F A 7 G V R L N A T A V F D 
7 77 C 0 3 AG C C C G C A C G 7 G C T C G C C G G G AA.G C 7 C G 3 C G A C G AAC 7 G A 2 7 G G 2 50 
F ? 7 P F 7 F A G F 2 G 2 F F T 2 
J 0 2.- ^ CGG GGG GGGG G '2 GG TGCCCCGG ACCGCvj jCv^.Cu^' — .G'^T-o ^. o wrv... o j- L> 0 
7 P. A P 7 7 F R T A A 7 A G A H 
A2GAGCCGC7GGCGA7CG7GGGAATGGCCTGCCGGC7GCCCGGCGGGGT7 3 50 
2 F F F A I V G M A G F 2 ? G 2 V 
G GGGG A G 2 G G A G G A G C 7 G 7 G G C A C C 7 C G 7 G G C A 7 C C -o G C A GGG ~. C G 7 C T 4 0 u 
35 A3PFFLWHL7ASC7GA2 

CACGGAG7TCCCGACGGACCGCGGCTGGGACG7CGACGCGATC7ACGACC 4 50 
T F F r 7 F R G W D 7 2 A I V D 



/~* " r 1 r* 



2CGA7CGGCA„AGACCTTCGTCCGGCACGGTGGCT7CC7C 5 00 



P D ? D A I G K T F V R H G G 



T G A T G F D A A F F G I S ? R E 
GGCCC7CGCGA7GGACCCGCAGCAGCGGGTGCTCC ZGG^:GAGG\ CGZGG^ 600 

AFAHGRQQRVFFE7SW 
AGG CG T 7 C G AAA.G GZZZG G C AT C ACCC CGG AC7C GACCCGC GG GAG CG AC 55 0 
45 FA F F S A GIT? D S 7 R G 3 2 

A C C G G 0 G 7 G T T C G 7 2 2 G C G C C T T C T C C T A C G G 7 7 A G G G C A C C G G 7 G 2 G G A 7 00 



^ G7 7 G o 3 2 j G 'o A - i - GGGTGGGAo.".^ o i ^-T'.j'^ T ^ . ^ ^U'-j-C .' 3 'j 
2 G F G A 7 G S Q 7 S V L S G 
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The /hrIl-A720l restriction fragment that encodes module 8 of the FK-520 PKS with 
the endogenous AT domain replaced by the AT domain of module 13 (specific for 
methyimalonyl CoA) of the rapamycin PKS has the DNA sequence and encodes the amino 
acid sequence shown below. 

35 A3A7G77GCAG;^~GC~GAA~~^~G~7GACGC^;rGCGG-7 ~~."Z»C r 5 0 

i • v. ■„Li.'-,'„ G'_ ■„-s t ,r*.CGTGCT ~ G>w GG ^wrGGGC j. ^ w oACGA^v- i '^.-.w ~> ovj ^ l 
P T r K 7 L A G K 7 G " 7 7 7 G 

40 A- ; .GCC JCGCGC 7CG7CG7GCGCCG GACCGCGGCCAGGGCCGGTGCGCACG 300 
T R A 7 V V ? ?. 7 A A T A G A H 

D E F L A I V G M A C P. L P G G V 

GGG7CACCCGAGGAGCTGTGGCAG77CGTGGCA77CGGCACCGAC7C7AT 4 00 
50 A S F E E L W H L V A S G T D A I 

t;Vuuor.G /7C ^CGA/GuGAlCCGCGG G ToGGA.CG FOo.-vCoCGAT C7 -.CoAoC 4 d <j 

T Z 7 = 7 D R G W D V D A I ':' 7 

G G G AG G G G G AC G G G A T GG G G AA GACGITCGTCCG G G AC G G T G G G 7 T C C ? C 500 

- ^ .*-. - ^.^o _u „ ^oC7 7 CGrvGG G G s 7G77 77T7GoC:i7CA > G i ^C^'^ ^'j»uA T- 3 j 

7 G A 7 G F D A A F t G I £ ? R E 
-jo'_ ^.lT^G - u." i \j G'-.Jo CnoC-.'_- 7 GG^ TGC TCC 7 'ji-jA.G o7GG 1 i^GG 6u C 
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- - • ~: — ^ ^. _ ^ ^ _ j o -i ^ _ . - . w - ^ w - ~- v_ — — - J _■ ^ _ ^ ■.; L. 

15 P 3 3 3 7 G F 5 P Q r G 1 A i : 

'_■ J" ■— ^- ^; -Z .*-«.•*. J-_5 — ^TT'^sJU'wU'wboc - — '-J'-'-" ~ ^ >~ .r. '-J ^ z:\ 0 0 

j F" A K A . 3 A 3 A 0 G . 3 r A FT, 

■3 a g v l ; v e ?. l s d a =: « :: 

— 0 . -^rv„r-. ^ w ^ . v_ _ . ^>\2 i L.G7 - v- 'J; - ^ T _ G'~> j; ^ j: . ^.".r 1 .^ J'jr. . V; . 1 J. -J 

3 H 7 7 3 A 3' V R 3 3 A 7 7 3 ~ 3 

- i ,, _ _r . " J ^. -j; 3 O - -'w w - ^ -J . O - j: j: J , - ~ _ ~ 



- ^OMj.;.... ..." „ _^ „ „ ^ ^5 _ r'.'o -J: ^. - . - j^' 1 .. J ^. ... ^ . . J.". ^ .-.'.j: ^ ^ 

3' Fl A :F 3 7 G T P. 3 3 3 3 1 3 A v 

_ : o o _ r *. G G „ . ^. ,— . ^ ^ \ r\ „ ^ o o o j ~ sj _ o _ ^ .* *. ^ „ ^ ^ 'o ^ _ w 1 _ 1 ^jo 1 3 'J 

/ G ' \j l r. . r _ _ 

^ ^ - o : w ^ j A^^T ^ ^.-G.CA- w 3'o C^.^CGC w^r^o'^C'jCo i CCGo ^ ^ . ^ v o 1 j 5'^ 
S L K S !: I G n A Q A A S G V A 
GCATCATCAA 

G I 3 K 3: V Q A L G K G hi I P P T 

— ToCA « GC*-- GAG GAG 'ZC GT dQ\Z0G GAG^r GG'oj". w 'T G ~? AGGGC GG-o ^GvCoT 1 4 d J 
35 L H A A P ? S P H 3 : D W T A G A V 

— o.^-vA 3 o GT G A,GGG GG 3G GGG G CwijV'j o ^ — G^j.-vo.'-v,- w ^ .—.G^'ow — _ G.* ^ „ I 5 0 C' 

G L L T 3 A R ? W P EG D P. ? r 
o 'j'jC j jo'^GTGGrG . G G T T - w w u A.oG ^ .^.o ^ o G „ _ v. _ ^ ^ . w . ^ i w ^ [ j 
A ,', 3 V 3 3 3 G V 3 G 7 G A Y. 7 1 
_ ? GGG.^j.-.GGG G3-7GG 3 PGGGGTGAGGGGwGGGA-'^ j.^. .jGG jG.G^ ^ ^ ^ . 7^-, ± b 0l : 

L 3 3 A P P A Q P A E 3 A Q P 7 3 
■^AGGG GGGTGGTGG GGTGGGATGTGCTGCGGCT'-;G7 G.AT.-.TG-joGG.AhG.". 1 c 50 

7 ? 7 7 A S D 7 3 P 3 7 I G A K 
~ ^ -^f\' j'.- G GGGC 3 7 ^ .-.3 G- A.' *.G A.^. ^rLr-. v3.".v ^ - - - ^ ^ ~- 'o 3'- w . „ vj i ■ J u 
45 3 3 p 1 7 P H 3 3 P 3 P A V 3 A 

-3 - - w G'-^ovov^: -j: ^ j> j.-'.T A7 AG o-; oG7'o - 'o--:^-/-\ - '-o.*-. .-•j^ i jo ^ ^-j - ^ l ' 5 U 

A S P G A 3 I P A 7 A 3 7 L A '7 7 
GuG7GGGTGT7GGAGGAGGGGGGGG7.~vG7G'^7 - G.". -.-.'^ „ * G-. 1 5 C- 3 

P. 3 7 P 3 H R A V L L G 3 D 7 
50 GGGGAGGGGGG7GG.GGGAGGCCP-.GG- m -7G'w7 , j71 To — -7 GT'~ _-w ^•^G^.-^o i 8^0 

7 G 7 A '3 7 G P P I 7 A 7 P P G 7 
j G G 7 G G G A G T G G G 7 G G G G 3.7 G G G G A G 7 G G A G 7 G G G G G A 7 7 ^ G 7 G G G 7 ^ G T 1900 

G W A W 7 G H G S A L P. 0 G 3 V V 
G77GGGGGAGGGGA7GGCCGAGTG7GGGGGGGGG77GGGGGAG77CG7GG 155C 
55 3 A P R :■: A G G A A A 3 P R P' 7 

G7GG G.A7CTG77 3.3G GG77G7G o^3i GGA JGG-G'j'oTG'j T'jo.-G^o GG77 200 0 
: W 3 L r 7 v 3 D 3 ? A \* V 3 P. 7 

— ^. ^ ^ _ ^, „ m ^, ^. ^ , ^ ^ rT - ■ ^ ^ /-> - " . r« - — — — - .— — — — — r - . . .-n ,^ - ~ 

D V '.' ~i - s W A :■: '-' S L .-. A v 

.jGG.AGGGGGGG 3 37 37 , .3Go'-7-^ , v , ovj."-. i ^'-\jo - '-=A : ^ jj^Cr. . ^ ^^^.n^'j 3xC0 
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The NheWXliol restriction fragment that encodes module 8 of the FK-520 PKS with 
the endogenous AT domain replaced by the AT domain of module 12 (specific for malonyl 
CoA) of the rapamvcin PKS has the DNA sequence and encodes the amino acid sequence 
shown below. 
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r r: : - r ? ? ~: 



3 F S ;' G ':' G T G 

. . I.- . „ ... o ^ ^ C u - -j C T o r.cA — ~G. o 7 ^ T o G - G G o _ „ ■' z G 

o : A T G S Q ^ G V ^ 2 G 

3 G G V P V G L E G F A V 3 V : T 

. •■ w - o . , _ - v . ^ G i Gu . j^C'jCTG'^r.^ ^r-G'-j^ w \z j'.ivr.oTCv; - * ... ^ 0 Ij J 
. - ^ _ o ^ >o — ~' - 1 j^-. ..GCTC.:.. 'v.- v T Gt 7 ; .- cGGo'j . oT ^ o \J * w . ~ ~ . . r 1 0 0 

25 3 G R G G G A L 7 G G G' 7 '/ A 

- - ^ \j v. vj ■ j _ . ^GTvjvj.-. oG"* - ~ - CCC jG ; w o 'o ■ ; O - i. G w Lj ^ x.- w : -. ^ ^ J 

: G G G G' G G S R Q G G G A F G 
-■ ■-■ G j' j o w j'jG-jGT C G _ o ^ ■cGGVii'wGur. ^ Go _ ^ w ^o^.G j. i G-o^C jr. - J 0 j 
F: A r.' G. f G A G A G G I S F ?. G 
30 GGGTGGCGGTGGGGTGATGGTGGAGAGGCTCTGCGAGGCGGAAGGGAAGG _0 50 
G A G 7 L I \ ! ERGS 0 A G F. !; 
^ . 'wAx-ACCuTl GGGGGGGT GGT GGGTGG TT GG o G -oG TGA.AG w.-iGi;.-. . u „• . l -J- 
G H T V L A v V R G S A 7 M G G G 

GTGCA-ACG-JG G GG GTGGG GG GGGA^ACGGGGGGGCG GAGGAGGGGGT GAG 1 ^ 3-0 
35 A G G G G G A P N G F S Q G ? 7 G 

- x. jv. ki .'.;G ^. G t G^'j GC AGGC GG G Go GCTC .AG^ GG jG^ 'ooA^ul uGa^ G _ ^ ^ i £ 'J C' 

G ^ A G A N A G L G F A D V D A 

. ~ ^r^v- 1 ,: G G x.-.~ v_ ^ ^ ; GA^C GG«.: ~-t: GGA^oGCTGvjG^o.Av^Gv— A-T^br-uoL-A^.-,.- ^.3 3 J 

'.' b: ~. t c- ? p. i. G z ? : e - ." 

* oG^ GG- GG A ~ vj'^^C.'-'j GA^tj^'o Go ■w^-.^G ^ GGGTGG7GGGG3G G50G 
A 7 I, A 7 Y G Q G R A T F G G G. G ; 
GGGGGTGAAGGGGA^CATGGGGGAGGGCCAGGGGGCGTGCGGGGTGGGGG 1 3 G G- 

3 G ^ K S i: I G K A Q A A S G 7 A 
j -,-.G Gr-.TG rAG - xjGTGGA/G G GGT GGGGC^AGo'o'o^.^-o^ TuC ^ w * -j 0 'j 

45 G I G K :■; 7 Q A L R H G E L P F 3 

x- . j ^ r-. G G G G G G ..; .A G G G G T 3 G G 3 G G A* C G T G G A. 3 G G G/-. G >o „ ^ o i o C G ^ ^ . _ 4 3 3 
.". n A G 1 3 F S F H 7 G W T A G A 'G 

... - j .-GG „ ^ . o.-^-_ ^- G'_ u'j GGG'~_- „- w v. oTGGG : jr.ijnv'v ■ „Cv;'j..i,r... ^'v: .. _ "j ' J 0 

3 G L 7 3 A 3 F W R 3 T G F. F r 
50 - o'^G'a ^ oG F _ ^G^'G ,j7T „'o G'o'jTuA'oGGG w". - ,„.^.A „ i ^ : _- ^ - ^ 

3 A A G' 3 3 F G G' S G G I) A. H G' ; 

- - ^o.'-.<j«jT GCc- jA 3 GGGTAGAGG G A. G A CGCCC'^C GGA J" ^oCCTT v ^'o ~ . - ou 'j 
- 3 A G F 7 7 3 7 ? A A G F S G G 

- ~ . G ... ^GGGG'G ._;G FGG GGGGA-.GG G7GA.GG r oG^^GCo C i C^A.^'or-.o--/-'--; _ o ^ 'j 
5 5 3 F G. G 7 G A R S F 3 A 3 G 3 3 

7G3GG3 3AGGG3GGGGGTAGGGGGAGAGGAGGGGGGACGTCG;,CGGGGT3 l^OQ 

- 3 3 G F A V 3 3 7 7 ? 0 V C P. 7 

A G r A Q T L A R R T K F A H R A 7 
60 G37GGTGGG7GAGAGGGTGA7GAGGACACCGGGGGGGGACGGGGGCGA3G GS0O 
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: . . _ or. . 7 _ r. . _ " : or. - — : ~ _. r. - - _ 7 7 . ..: 7 ; . _j .". . 



■Or .;. 



- .7 ^ s o o . .j . . _ .7 ..: j . , ._; O - „ 7. . - 7: or. . ,. . . - _ r. ^ - „ J. „ 

. _ o . — z .2 _ ■„- o oo o . ..ir. -~ r.o . _■- .7 o . .j or , . : ... ^ ^ 

• -"• ' - - - - i ■ - - ~G jo - . - .iJr.Jjr.. vj.-vo'j-w - . • o - - ... - J -'■ : or. Jl _ -J F 

>: v a v ? v 5 £ a £ a f a '-■ :. £ £ 

■J o ~ v; . ^ jr.'j.-.TC oG „G? o J - 2 ro-.G o G C G G o A - ;o . o -"; .... ^ A ^ ^ J 2 -L D J 

•'.'j 7 F 1 A A V N G P G £ V 7 £ £ 
— 0 7. j A T ^ - j ■ j . C o " " ^ 7 o C " v A 3 o C C u C G ^ - ^- G ^ j . - oGoj— : oor.£o 2o-37 

G c r. _~ \ o r. r ~ 1 j o :' 

~ j ~ . o „ _ o o O ro o - oAA ~ r- - G O roo ^ ^ o _ r. * ooro\ ... . _r* - o w - J J 'J 

F. A A 7 S H A F H 3 A F :■! A F M A 
j'-i ".'j or. j . , v.. c j^^'jGT ^ j _ ^ G "«~i^vt v.. C i G,~i „ „ . «'j'.j,~\ .: . ._ j'„>,Gv 2 i 00 

25 ^ £ A F. A V A £ G A 0 A = T ? A 

- ^ A'^C-. i GG A AGG T G.AT-..AG jTo.ACC^-.GG - - , - I ^O 1 ^ ^ AGG ^ 4 "■ t.. 

S A G G 0 v - ; A T .A A G F 

'v. T „ .rjvjnL r.CGGTC C TT - o'jCuAG ^ A'.: 1 ,; . j^~^A-^ . ^ j.^Ow.-* 2-0 j 
■:. A 0' T V P. F G E Q '.' A £ A E G 

30 . . jO ! - ; jTGT7CG ACGA.GGA GGGTGG GGA.CCGGA G.- : .G - ^GGG _G ~ AG^TG-o 2^DU 
A V F 7 A G G A 0 R £ A A A G V 
AGGGAGA 0G ; A G G A C G AAA TOG A GGC^GZ G A A C 

G G A A y. A H G A H Z G 0 A A 1 G 
- GG . ^ GAC JAGTA.AGAv_.AlA : w>oG-^-'oTCAC^jo - ^ j.-.CT-j .j ._ - -GGG^T 2 6 !j0 
35 A A A H A V "i G \' A '/ G/ A G A G 

^ ^ ^ „ ,^ _ ■ r-, y-. ^ ^, t. ^. - — ^ ^ ^ ry. ^ - — ^. . . ^ - '~ - - ^ <~ .~x ^ ^ ■ 

— - . o o '.j ^ ^ ; . \„ j. _ o o ^ r~^" ."i ^ j l o w _ o ^j." > - w or. _.-.-.*-. ^ o w „ „ < ^ j 

G G G A r A A r. V AGG F- G" V A 
. - o .... ^-.-.'^G'^ GT.AGA'^GoT 0 oA.oT v_oooA'^. o _ ^. ^ -;o^.'... ^ . wGo.--*- £ .■ G ; .i 

~ - ; y :■: l e s .-. ? ? a a 5 : 

G OGG'G G G'GGGGAG-GG AAkTCGGG'^GG G oo ^ . ... ^; 0-^oGG ^ c; C' 0 
A G ? 7 L G 3 G I A A A G £ p G 

OGGGGTGATGAGCGGTTGeGTGCCGACCGGTGOGGAGGGOGOGG AOATOG 2 8 50 
R 7 A A G £ V P T G A G A A 7 F 



2 300 



45 A A L A L A A A D A 7 £ C A A V 

Gi.AGGGG G ACG.AG.ATGGCG AOOGG'GCGCGGGGGG GGG joC ; w - G o G GGGG.AG 2 95 0 

£ P G A I A G '/PGR? £ :: 3 A T 

o.' ■. _^ „ GT." -mG.^.OC AG GG AGGA.GG.--G G'-uoCGbn^ j^-^o ^t^, ov^ooT 3000 
T 7 0 T W •.' P E r A D P G ':■ ?. R 

50 ::ACCGTPrACAcr G G G .A 3 G o o ^ o GlCG'^v! . o ^ ^ ^ - o „ o ^. o r. o j 0 3 0 
F T 7 H A A G G D A P ;G £ £ - A £ 

j! « G o i o ~. A 'o C o C G G C G .A A o o - GGwL i G C o o o .". , ^ ^ o o ^ ^ ^ ,-. _ .j ^ ^ o ^ ■. o 1 C> J 
£ V £ R P H A T A*. L F G A A £ A F 

.j : ■. j .j ' „ _ £ G G j j G G ■„ o J i '-j ^ - o ' J o u o o _ . j „ j t: ._- . ; . ^ ;G-j„ J £ 1 1 

55 ■-; £ G ? G A 7 £ A 0 G £ ? £ '* K 

-~ - _oo\j GA-.GoTG AAA ^'j.^jGl^u.HoG ^ oo.-.--'o'or-.o ^ ^r.^GuG 32uU 
r G G G 1 G: '/ £ A £ A E */ £ G F £ G 

- 7w£'Go.; - oG.-. G £ £ G G A ^ GG 1 ^ ^ ~~ r.^'oCuG G 'w . - _ - - ^ o _ . - jjGu.-. jG^'J 
F 7 7 H P D £ £ D A V F £ A '.' G 0 

6 ( J £ G G AG*. G G G G G G A G G G G G G G G G .A A 3 G C G C G A* C C T G A G G G A G G A G G G G T C G 3 3 30 0 
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7 H 7 7 1. H H ? H 7 7 ? L ■ 

- j ■ j ^ o ■ j _ i — C vj ^ .-i „ „ „1 ^-o wCbGCATCC * v.- o ^ „ o i 

■j S <j . L A G i r^. H _j 

3 R T P P ? 0 A 



? : D 7 G D P ■: L A ? 7 

'77GG777V777777ACCGGCA7P7T7 AA7ACCG7 7777.-77 ^ 100 

;GCA7777::A7G7CC7CA777:C7A:7G7C77A77A7:77 h 150 



40 7:;:A7 7 7 777GA7AA7GAAA7TA7G77G; 



t V. V; o i\ I - K ^ L, ^ i" 

i T H F H 7 L G Q ? A 7 : 
.TGTGGCAC 

45 W G M W K 7 7 S T L T G Q L 7 
GAGCGGGACCGCA7C7G7CG7GGCGGT7TCC77C: 

0 P. D F. I R P G G F 7 F Z 7 F- 7 7 
j w *v ^ A I *j 7>Vj: oriT G C A,7 

50 

The A'7/<?II-.\7?cI restriction fragment that encodes module 8 of the FK-520 PKS with 
the endogenous AT domain replaced by the AT domain of module 13 (specific for 
methylmaionyl Co A) of the rapamycin PKS has the DNA sequence and encodes the ammo 
acid sequence shown below. 

^ D .-vjATC - r7r. j 777 77 7G.-vA.G 7GCTGCTgA.C<o _TGG — ^ob'jn^r.jCn-^ o 
v L A F A L T L V P 7 3 T 

G 7 7 G C 7 G 7 G C T C G G C 7 A 7 7 7 G G G T G G G G AG G AC A 7 7 7 - C G G G A 7 G G w : ^ • j 7 1 0 U 
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3 V W ?. F 



A 3 A 3 7 r.. 1 G ? D G F '.' 7 3 - I 

I 0 : -.; _ _ _ _■ _ j: „ . „ ^ . 7- .''...'..'o'-'JoT ./vi ~ _ .*-. " '_. -J J J 1 v ' 

i- 7 7 r a v g r g g f ; f a 

-J .j.-i . j „ . _ „ _ J : ■. ' J c i „ [ o i 'J .;^ ^ — ^ ~. 'f J \, 

', >: 3 7 7 7 ri A 7 3 A T '.' 3 7 A 

- , . : „ _ ^ o -~. 3 'c: 'O J'_ - 1 o'J ■-. - - - -J — — -J . - .. .1 S. 4 ^ 

15 3 r f p. 7 3 g a :•: g ? A a 3 ;: 

3 A. . . r 7 3 7 A E A 7 1 3 . . 7 

..: . _ ^; . , ; 3 7 \. - /vo 1 V o'Js.". - - - — - - ~ - . - - . n-- J 3 Z> J 

6 z- 3 Z 3 3 3 3 3 o n r 

20 33 333- w 3773 33 3A.GGG 3 3737 A.CoA^GGGT'j.-u — . ^ _ ~ c . j.^ j ( j 

3 A '.' A 3 A 3' V 3 3 3 3 3 7 

3. : - a a h r 3 ? 3 : : :■ : r 

25 A H 7 3 A T ?. 7 3 7 A 3 3 3 3 7 

3 A. 3 3 A. 33 3 A. 3 3 A. 3" A. 3 3 C 7 3 A. 7 C G 7 C C A. C A 3 3 A C 3 .-- . -~ ^ ^ „ ^ . ^. ^ o . ; „ j> . o ■ ^ 

7 7 3 3 T 3 I V K 7 7 T 3 ? A G 
C 3 A3 C G 7 7 A.C 3 G G 3 3 7 3 A.3 CCGC AC CG C 3 3 AG AAC G A_A3 A 3 3 3 3 3 A3 3 3 3 38 00 
A 7 7 7 3 3 7 R T A Q N 7 3 3 » 3 
30 A7G3G3 373A7 3GA3^AC3GACCA3C333A3A33G333733 3 3 37GGG33A 33 50 
I 3 3 I 3 7 D H P H T P 3 ? 3 A Q 
AG7CGGGA3C37GGACCAGCCCCACC7CGGCC7GAGGGAGGA3A3G:73G 3 900 

3A733HFHLR37H3 7L 
ACCA33333A3 373A33333373CACACCA33A3333A3 33A3 3A33A3 3 3 953 
35 HH3H373LKTTT37 7 77 

33 3 37 3 A~A 3 33 3 GAG-.C A 3 G 33 A7 V C A.7C A.73 -.33 -:G ^ o - ~ - - o - _ - - - 7 4 0 3 3 

F 3 3 3 3 3 A 7 I Z 7 G G 3 3 7 3 
■^^G 1 — .j*^ - .-77 377 3 G 373 G 3 3.-'/7^7'j.A;-'. _ 7.-. „3'„ ^ „ - . . . ^ - - ^ . „ - ; i U" 1 .' 

A G 3 3 A. F' H 3 N H 3 3 . 

40 3 33G3AG373A3 3 33333A3GG3A3G3333G3A33 3A337 33 333 3 73A3 ^ 1C3 
3 3 7 3 F 3 3 A 7 3 G 7 3 3 3 3 3 
G73GG33A33333A33AA373G33A33A333T3A333A.3A333333AA33 .4130 

7 G 3 3 3 3 3 A 7 7 3 7 H 7 F 3 ? 
7777A3GGG3A737733A3AGCGCCGG3AC3G7CGA73GA3G0GA7G37GC 4 200 
45 3 3 A I 7 H 7 A A 7 3 3 3 G 3 7. 

AGG3G370A33GG3GAG3GGC7GACCACCGTCCTGGACGGGAAAGGCAAG 4 250 
H A 3 7 ? D 3 L T T V 3 n 3 3 A 3 
GCCG3G73GCA7G7G3A33ACCTGACCG7GG3ACC3 ' 33 3G7GA333A377 4 300 
A A V; H 3 H H 3 T C K Q P 3 7 H 7 
^ 3G7337 37A.3733.AG3G33G3GGCCG7GC7CGGCA.G333 3 GG^.33_-GGAA. 4 i ^0 
7 T 3 7 3 5 A A A V 3 G S F 3 3 G 
A 2 7 AG G 3 3 G 3 3 G G 3 AA 3 G 3 G 7 7 G C T C G AGG G GC 7 3 G 7 C A G 3 3 3- 3 3 3 7 3 A 3 4 4 00 
3 7 A A 3 A 7 3 D A 3 A 7 3 F. H 

A 3 3 G 7 3 G 3 3 G 33 , C G G G G G A G 3 7 G C A TCGCC 7 G G G G G A T G 7 G G G A 3 A G C A C 4 4 50 
55 7 3 3 3 3 A 7 S I A W G 3 W 3 7 7 

G 3 3 3 37 7 7-. 3 3 G G 3- G .A A 3 T 3 G A. G G .A C G v, 3 1 j 3 3. - j ^ ! . — . v - _ o 4 o 3 0 
3 -3 3 7 G 7 ! 3 7 A D 3 7 - 7 F 



7 D D E G 



60 
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Phage KC5I5 DNA was prepared using the procedure described in Genetic 
Manipulation of Sircptomyccs. A Laboratory Manual, edited by D. Hopwood et al. A phaee 
suspension prepared from lo plates ( Ljo rnrm of eomiueni niaques of KC515 on S iivicUuis 
TK24 genet ally gave about 3 ug of phage DNA. The DNA was ligated to circularize at the 
cos site, subsequently digested with restriction enzymes BamHl and Pstl. and 
dephosphorylated with SAP. 

Each module S cassette described above was excised with restriction enzymes Bgfll 
and Nsi\ and ligated into the compatible BamHl and Pstl sues of KC51 5 phage DNA 
nrepared as described abo\ The ligation mixture containing KC51 5 and various cassettes 
was transfected into protoplasts of Strepiomyces lividcuis TK24 using the procedure 
described in Genetic Manipulation of Strepiomyces. A Laboratory Manual edited by D. 
Hopwood ct al. and overlaid with TK24 spores. After 1 6-24 hr. the plaques were restreaked 
on plates overlaid with TK24 spores. Single plaques were picked and resuspended in 200 
uL of nutrient broth. Phage DNA was prepared by the boiling method (Hopwood et aL, 
supra). The PGR with primers spanning the left and right boundaries of the recombinant 
phage was used to verity' the correct phage had been isolated. In most cases, at least 80% of 
the plaques contained the expected insert. To confirm the presence of the resistance marker 
(thiostrepton h a spot test is used, as described in Lomovskaya et al. (1997), in which a plate 
with spots of phage is overlaid with mixture of spores of TK24 and phiC31 TFGZ4 lysogen. 
After overnight incubation, the plate is overlaid with antibiotic in soft agar. A working stock 
is made of all phage containing desired constructs. 

Strepiomyces hygroscopicus ATCC 14S91 (see US Patent No. 3,244,592, issued 5 
Apr 1966. incorporated herein by reference) mycelia were infected with the recombinant 
phage by mixing the spores and phage ( i x 10* of each), and incubating on R2YE agar 
(Genetic Manipulation of Strepiomyces, A Laboratory Manual, edited by D. Hopwood et 
al.) at 30°C for 10 days. Recombinant clones were selected and plated on minimal medium 
containing thiostrepton (50 )ug/ml) to select for the thiostrepton resistance-conferring gene. 
Primary thiostrepton resistant clones were isolated and purified through a second round of 
single colony isolation, as necessary. To obtain thiostrepton-sensitive revcrtants that 
underwent a second recombination event to evict the phage genome, primary recombinants 
were propagated in liquid media for two to three days in the absence of thiostrepton and 
then spread on agar medium without thiostrepton to obtain spores. Spores were plated to 
obtain about 50 colonies per plate, and thiostrepton sensitive colonies were identified by 
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replica plating onto thiosirepton containing agar medium. The PCR was used to determine 
which of the thiosirepton sensitive colonies reverted to the wiid type (reversal of the initial 
integration event), and which contain the desired AT swap at module 5 in the ATCC 14S91- 
denved cells. The PCR primers used amplified either the KS AT junction or the AT. DH 
junction of the wild-type and the desired recombinant strains. Fermentation of the 
recombinant strains, followed by isolation of the metabolites and analysis by LCMS, and 
XMR is used to characterize the novel polyketide compounds. 



Hxample 2 

1 0 Replacement of Methoxvi with Hydrogen o r .VI ethyl at C-I3 of FK-506 

The present invention also provides the 13-desrnethoxy derivatives of FK-506 and 
the novel PKS enzymes that produce them. A variety o\ Sircptomyccs strains that produce 
FK-506 are known in the an, including S. tsukuhaensis \'o. 9993 (FERM BP-927), 
described in L : .S. Patent No. 5.624,852, incorporated herein by reference: S. hygroscopicus 

1 5 subsp. yakuslumaensis No. 7238. described in U.S. patent No. 4,894,366, incorporated 
herein by reference; S. sp. MA6S58 (ATCC 55098), described in U.S. Patent Nos. 
5,1 16,756, incorporated herein by reference: and S. sp. MA 6548, described in Motamedi et 
al., 1998, "The biosynthetic gene cluster for the macrolactone ring of the 
immunosuppressant FK-506 " Eur. J. Biochem. 256: 528-534. and Motamedi et ai, 1997, 

20 "Structural organization of a multifunctional polyketide synthase involved in the 

biosynthesis of the macrolide immunosuppressant FK-506/' Eur. J. Biochem. 244: 74-80, 
each of which is incorporated herein by reference. 

The complete sequence of the FK-506 gene cluster from Streptomyces sp. MA6548 
is known, and the sequences of the corresponding gene clusters from other FK-506- 

25 producing organisms is highly homologous thereto. The novel FK-506 recombinant gene 
clusters of the present invention differ from the naturally occurring gene clusters in that the 
AT domain of module 8 of the naturally occurring PKSs is replaced by an AT domain 
specific for malonyl CoA or methylmalonyl CoA. These AT domain replacements are made 
at the DNA level, following the methodology described in Example 1. 

30 The naturally occurring module 8 sequence for the MA6548 strain is shown below, 

followed by the illustrative hybrid module S sequences for the MA654S strains. 

G C G G C C G C 3 C T C G A C G A CGC3GCGGACGTGCC3CTGCTGCGCGGGCTGCG 10 0 
35 A A A L C D A ? C V r L :. ? G '.. ? 



SUBSTITUTE SHEET (RULE 26) 



WO 00/20601 



94 



POYUS99/22886 




SUBSTITUTE SHEET (RULE 26) 



WO 00/20601 PCT/US99/22886 



- - : - - .„ v. ,. .7 J j; .:■ „ ^ .. . j: . _ ' _ ~ o - : .: - -J - I Z L C 

r _ : r z L: r j . r : . w . . ■; : . 

- - - J - , _ _ „; ^ .j j. ^ ^ j: -v^'j'^ .-v._. i w .J .* -„ „ j.- ". .-.".J.-. . ... ._: ..r _: _ ... ^ _ ^ 'J 0 

A A 3 : _ ^ . r. L 

J ^: _ t — . „ ^, * CG.-'.Cr'. - _ — „U ^ -J ^C-^r - — ^ ^ o ._: _ _ - "_r - '~J ^ 2 ^ ^ IJ 

A A G 1 0 7 3 ? G '■' 2 r A '." A 

jrr „ x ^ j; w G oGC'J i ^vO ^ -~ ^.TTCrAv. ^r. ^'v. ^ ^ o „ 'w \j . . ^: — . 'O C 2 0 0 

A. 2. ?. 0 E 7 G E A ■' 1 A G 

- : --'ji v..A o ' j ^ j C . w — w — G ■ j ^ .~ - ~ '-3 - — - ^ - ■ ~ r-- -. ~ - - T C T i d J 5 u 
2 7 I G A 2 r A A A A. D E J. V F 

^ - . 7 CCvj: G7CA : j<o^ w _r.J l -A? w ^- - „-." - .j '-r o — _j .-. _.-..-'.■>- 1 ^ I 0 0 

v v i: g q G ? ^ h ? a :: G h: : :. 



— 3 'v.. _; .-.C^,;vj'j - ^:G - ^ ♦ i" - - v '- . J^'^Ci \~. i. vj.-.-j *J o '.J ~. -■ ' -^S". ~ ^ ^ ^2^0 

A H A A A T A L L t< 2 A G : T 

^ w.-. _ ^ ^ ^, i.j 7 G .~, TC'wwv'v,.".. . . j^TCG^C'j :j .A . ^- -**. ~ - ^ - * ~. v z - o 
r r. A v' I G r: G L G c . .-. : J*. 

2 ~ G-^GATCCTGGCGGTGGAGGA.CGCCToGA.CGG - _—TG.-C j.'jTG^ _ ^d'j 

30 ag:gsg:gactgtttra 

gggggtgatgcacacggttggggcgccgggcgcgatggtgacggtggtga 2 4 00 
2. a m h t g ? f f g a x v t v l 

CLno G'o^GGAG^jA^GG GGGGT •-.-.GGv_-Gv j . TvjGGGCC - o-o Co TGGAGATCGCC 2^50 
T 3 £ E E A H Q A L R ? G V E I A 
3 5 3GGGG GTTGGGGGGG G.ACTGG GGCG7GGT GTCG G CGAC-j.-- : ~ ..-.-CGCGGT 2 z C 0 

A •/ G G ? H 2 V V L S G A E 2 A V 
GGGC GA.CGTCGCA.GAGCGGGT GGGGA.GGGAGG.AC' GGGCG^ GG G 2 2222GZ ta ^ 5 0 
G 2 V A C- 2 2 G : H H r G ? A G 

j-. ... ^ _ o ^ Gr 1 . G G G 2 G j ._ ^ ; - L ^: i ^ -> - — - ~- '* o ----- o ^ — ^ C 'J v 

40 H A G H S A H E 2 V A A 2 G G A 

A 2 2 2:2 2 C 2 Z G A. G G T G G G G G 2-. G G 2. G 2 G G G G G 2" A G A* 2 G o C G ~. T G „ ^ .-.AC G A. 2 o 5 0 
2 2 A E G 2 V 2' R F H G A 1 ? N D 

G C A G G A 2 2 G.A.A C G G G G A G G A G T 2 7 0 0 
2 G A A E V A E A V A N 2 7 L 

45 A G GAGGCC GA.CAC GG.AG G 3 2G AG 2CCGP-.22 2 222 2 . Aj^ A^rA-„ ( ju^ <_ -^u 

G H A H G A 2 2 ? 2 A 7 2 2 2 G 

G G G G G CCA GGACCTC7 GA 2 2 3 2G 3GT CG2-.CG GCA. - 222 GG G A A; j.^v22w ^ -r' u Ci 

? GAEL c 2 G V A G I A 2 A U G 
2GAG G GGGGACGAGGTG GAGGG GGTGCA.CACCGCGGAGGCGCGGGTGTGCA. 2S 50 
50 G A D E 7 H A G H T A A A A A E 

GAG G GGGCGCC A.CGCTCG AGG 3 GCCGCGG AT CCG -ojCoG — - ^ uGC'jb 2900 
A r G A T G G A S A I A A G A S R 
CG^GAGCCTGA.CGACCCGE GGG.AGGCG77CCA.GC w o ^G - G G . .-^TOb.-A 2 950 
H G P D 7 P S : A E Q ? P r '( V, I 
^ 2> C^j.A j;7 o'oCTCGCCC o o 2 G.". - -~ C G^jA.CT C G G'_; x_ — - - - '~o^^A /;U'J0 

G 3 A 2 F A G A C 5 G 2' E '.' G G 
GCGGAG7CGCGG7CGGGGGGGGGCGGGGCGGGG7G77CACGGGGGCCG7G 30 2 0 

. ^ r j; r. ■ - - r v 

CGG GGGGG7 GGGGA.GC GG GG 3G7GTTCATCGCCGAA.CTGGG ^ G7 „ GCCGC 2 1 0 0 
60 2 A G A G :■ A E I A E A A 2 A A 
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The Avrll-X/iol hybrid FK-506 PKS module 8 containing the AT domain of module 
12 of rapamycin is shown below. 



;ggagga:g 



'GGAAGAGGACCGCGAG; 
W N S I 1 A T 



250 



30 



jo 



40 



45 



DO 



3} 



a G 'j-nGC/'jo G G G G r r A, C G G G 



G A A A H D E 



G T 0 A I T E 
AGGCGGGGTACGACCCGGAGC 
D A L V D ? D 
,, A GGo w jo GTTGGTGGA.GGG7 

I S ? R E A I 
. G ncA G T C G T G G G A G G C 'o ^ 
LETS W £ A 

G~G joGlA^j G0AGACG0G - 



o o o -> w- I - o - ^ ^ o r-. j : \ 
* ;, T. A A R 



;::a 450 



A ? V A A R T A A 
jCTGGGGAGGGTGG GCATGGCCTGGCGT 500 

LAI V G IS! A C R 
:AGAG3AGGGGTGGGGTGTGGTGGGGGG 5 50 
- Q E L W R R V A 3 
_ OC^^uv/GGALCb'^'j'j^Tb V"j o A C ; o Too o 0 0 
F P A D R G W G V 
" 3 A G 3 0 GAG CGGC AA. G A C C 



_ O i ^ O J 

I G K T ? V R 

G G T G GG AG G G G G G GTT G T T G G G 

~ T n l ^ r r ~ 



65C 



K 



^ j o - \ . G u o T G C G G A T A G l AA 
G T G A D T :■: 
r o i _.GTG^ . ^'CCCGGCCGCCTCZ' 
3 V G S G R L 
' ^ T l, AG o o ? C ■ j mG A C C G G G 'I o 0 

7 T V D T A C 
A GG G G AG T GCG TGG G G T CGG G 
G G 5 L R S G 
. ~ A G G o T b A T G G G G T G G G C C G 
V T V M A S P 



J G AAA* G G G G G G G G A. T G A C G G G G G A G G G G 3 0 0 

G S A G I T ? D A 
" TTTG.'-.TCGGG^jCGTTGTC'wTAGou'oTAv 8 
; F I G A r s y G T 
ATGTTGGGCGGGACAGGGTGGGAGACCA 900 
GRGATGSCT 



. TAGG^Gw 3 o G A ; jG'jCCv'i T ^ ( 



* " ' o i_i L o 

^r^^r^rT,^,^^ G G ^ ~- n - ' 
3 S G V A L H 
-A.TGGGGGGTGGGGCTGGTGGGGGGTG 1 0 50 
; G 3 L A L 7 G G 
rGATGGGGGG AGT TCTCCGGGC A.GGGG 1 10C 
G r V E R S R Q R 



j _ j. j 'J t 



ooLj v.. : j L o ^ V/borll 'o vj'j ^ Vju 

G L A P D G R 

T S F A E G A 
AC G C GGAGCG C C AGGGG G AG A 
G A E R H G H 



A R G A G A D G 



G A G V V ERGS 
rGTCCTCGCCCTCGTACGCGGCTCCGCG 12 50 
G A 1j V R u 3 A 
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r r. r. ■<> v ^ ^ 



G A G 7 I G E S A F F ? 2 ? A G G 

_ o ... jr ^ : . _ ~ wU'^AoT'jj^ . u ^, - J » i-j^i j: „- .~v ^ C- v. 

^ r „•-. r r.. V7 V r 
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? G 0 G A I. G £ H E G P G ?. A ':' G 
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A A G ? G V D M R A V A S G G A 

wr.w'j'o i ^ ..3 i j G G C Gr.'j C."/-^^TGl CvjT^Jw G'^ : .jO'jriCjr-. . „ 1 9 ^ J 

G G G G E H G A V G G G G G T 

- G.~.' — oLtC^-.G GTuTGTGTGA.CCC rCuGGwo-^TG ± . C-^ i „ ^^^^j^.--. .■! 0 0 . 
V G G G A V S D P R A '.* G G P G 

G A G G G G T G G G A 3 G G T G G T GG G A 7G G G T G AG G AA G 7 G 3 G C G C G G G G G G G G G 2010 
■l; G S Q P. A G M G S £ G A A A P ? 

_ * ^ ^ ' j ; v. o ^:A- : G G A»T G.'-. jG.-.G'J T G T O ^3 v- . ^ - : - ^ „ „ G 1 ^ 

'.' G A G I K G Q V W G :. G P G P 

I G E '.' !,* G 7 G Y A Q P A G PA M 

... r- t ^G . ojG7G I G77GG GGG7GG7GGAA-TCGTGGGG7 ^G.nCG.^G ^.G-o.'-i-^^-G 2 2 C 

G Y A L G G G G E 3 W 3 7 P P G A 
j j ^ j. 'j; o ^,.-4 7 GG j - GGvj7GA^jG77vjGG . ^ ^ : A. 3 'o ^ ^ - ^oo^-^ ^ ^ ^, ^ 
*.' 1 G G G 7 G E L A A A Y 7 G 3 

GAGT7TGG7G7 GGG 3GGGGG GG G 37 GGG ^ 3 G G 
7 W SEE D A C T L V S A R A P G 
I G ^ .-.o\jC 7 G 7 7 G G G GGGGGT GG GG 7 G A.T GG T G G C . G - GG o ^ ^ . _ ^ 2 3 "d 
N! QAG ? A G G V M V A '/ P S E 

GG-A.TGAGGC GGGGG GGG7GG7GGGTGAGGGTG7GGAG.-.TCG GG G G GG7GA. 2 4 0 G 

3 E A R A -v G G E G 7 G I A A V 
.~GG^~ . G^7G'-:3 GGG7 3GTTG7GTCCG"sjTGA.TGrvGow .. _ ^'o . j.. . jv/G ^45*0 
G G P G G V 7 L3GDEAA7GQ 
^G — ; _ >o j^GG ; ^G7GGG'.j3^G7GGACo'wGG^Toc ^ ■ — - x i G d 0 

.A A E G L G K W 7 R G A G S GAP 
_ A73 n GCGGGGG7A.7GGA„AGGGATGCTGGAuGAGTTGGGG ! jG G j: 3GG - v. 'o I z -5 ^ i 
P G A P K E P 7 E E E P P A G A 

^ ^7G.'-,3 G 2 A G G GG.- GuGGGGA.GGT CT GG.-G GGGG cT i ^ o ~ - — . - ■■-.^ ^ c. ou ^ 
G G G G P 7 - Q 7 S f-! A 7 G 2' 2 

* _ ^ r-.u^.o 4 J.- jT.Gw'j G'o jTu^ufoi»_r-iGoT ^.-^'^o'^.-.'^/iG-O v.; - ^.o ^ .7 ^ o^'J 
'/ 7 T A E Y W 7 R Q 7 P. D T "2 P c 
3GGGGAGGAGG7GGGG7GG7AGGAGGACGGGGTG77GG7CGAGG7GGG7G 2700 

7 E . G 2. G ; EDA '2 E ': E G G 
C G G7-.C C GG 7 G AG 7 G G G GG GCCTGGTCG ACGG TG TCG G G AT GC T GZkCGGC 21 SO 
G p. £ L A P. E V D G V A. Pi 2- H G 
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7 2 v; ? v a £ a h v : 2 a : i 



: 273735 t p 7 p 2 - c : : : 

r- '." 1. 7 A L r J H H 2 - 2 2 

2 : 7 7 7 7 D 2 2 3 A A 7 '4 - ; 

J -J C G 7 A „ .3 3 AC AAA A 2' 7 AA.C AC CCCCC*-^ ^ -.-.7 — .-•-I — G „ .. - j 0 ^ 

T 7 3 A 0 N £ H ? G P I r. 3 : 7 ■; 
2GACCACCCCCACACCCCA2TCCCCCTCACC3AA373A33AC337C3A32 4 050 
20 H H ? H 7 P 7 P L 7 2 1 7 7 7 H 

AA03CCACC7ACGCCTCACCAACAACACC3TC3A3A33 3CCCAC3T3A5 2 4 100 

P H 7 7 I 7 N M 7 7 7 7 2 7 7 7 
22CA7CA33ACCCACCACAACACCACCACAACGA3C3CGAA3A3332A22 J 1 5 7 
p I 7 2' H H 5 7 T 7 7 7 2 5 7 2 4 
25 7G7CAAC3 3CAACCACGCCA7CCTCATCACCCGCGG 35 33 3GCAG3C73 3 4 200 

7 i; ? n a a : 7 i r 7 g g g 7 7 

0CGGCA7C77CGCCCGCCACGTCAACCACCCC3A3A33 7ACC7G37373 7 4 2 50 
A G I 7 A P. r. L M H F H 7 V 7 7 7 
G G C A G A C 7 A. G G A C C C C C C AG C A C A C C C G G C A C C C 2 A 7 G C C G 7 G G G A 7 7 7 4 300 
30 7 7 ? p ? F 7 T P G 7 H I 7 7 0 2 

72-X^CGAGGGGACCCA^7CACCCAAGCCCTCA2G2AGATACGA7AAC227 4 35 0 

TGP7QITQA77H ~ 2 ? 
77AGCGG7A7G r rTCGACAGCGCCGCCACCC7C3A223,2 7 7CA2 727GA2 2 4 4 07 
7 7 G : 7 H "" A A T 2 2' 7 A 7 7 7 

j> S 2G-.777 77772* 7 7 77A7\ 7A7277A.CC 1 ACGAi.C"^C i ■■ 2 — ^: . t j 

7 2 7 7 Q H 7 T T 7 7 3 r K A D A 
G 3 3 3 7 3 G 3 A C 3 7 3 3 A 3 G A 3 7 A 3 A. 333 .AA-AAG 3 AA 3 2 G 3 T 3 A 3 3 G A. G 7 7 3 3 ■ ! d J u 

A W 7 L M H H T Q N 2 2 3 7' r. 7 
77 32 , 37.A27 3 3.AG3G33G3333CAGCC7CGGCAG333 3 7 GGCA^A jCC---*-... -.zzu 
40 7 3 7 3 S A A A T L G 3 3 G C A 3 

7A3G33G33GCCAACGCCT7GCTCGACGGGCTG332A33CAGGG33A3A2 4 600 

Y A A A N A 7 L 0 A L A 3 H 2 h 7 
G 3 A-A G G 2-> G AA C C C G C C AC G A G G AT C G C C T G G G G 3 A 7 G 7 3 G 3 A. G A G G A 3 2 A 4 650 
Q G Q 2 A T 7 I A W G V, 3 H 7 7 
45 CCACACTCACCAGCCAACTCACCGACAGCGACCGCGAGGGCAT3CGCCG3 4 700 
7T3TSQ2TDSD7 2K:RR 
GGCGGC7T3C7GCCGA7G7 33GACGACGAGGGCA75: 
G G 7 3 3 I S D 3 E G [-! 

50 The Nhe\-Xlw\ hybrid FK-506 PKS module S containing the AT domain of module 

13 of rapamycin is shown below. 

'.j'77,73 33 G 3TGTACGA.GG2-wGC-AC.GGCGG.^'^^.G ^ _ ^ — ^ - ~ - - i z y. . 

'4 2 7 7 2 A A R R T 2 2 3 2" '.' : .' 
G C G G G C 3 3 G 3 T C G AC G AC G G G C C G G A G G 7 G C 2 G 3 7 G 2 7" G C G C G ^ G G 7 G - o . 7 J 
55 A A A L d D A P D V P 2 3 ?. G 3 ?. 
4 5373.3 3.- 3 3 37 3 3GG 3 52 w 2 2G^ v.o7 3 ^'ooor\.- ^ - - - - - - 

P. T 7 V R 7 A A V F, E P 3 I A D 
GCTCGCC G7GC7GCCCGACGACGAGCGCGCCGA.CGC27 CCCTC 3 CGTTCG 200 

R S 2 C 3 P 7 T 3 A P 7 t P S R S 
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. . jT'j i *„GoCGC G77 OCGGG'o.-v^ A-'~- : - - A j^n'j.-.TCGGG 3oG37 1900 



TTGCGCGAGTTCG TGGACTGGGA^TCTGTTCACGGTT 3TGGATGAT CG 3 2 3 22 CO 
20 2 R E F 7 D W D L F T V L D D P A 

o -jTG'^TG^ACGGGG - TGATiaTG'jTCGAGC^Cvj'-^TI - 'ouG'-^AT^r. , 22 50 

D P 7 D V 7 Q P A 2 W A M 7 
/T70 Z 27GGCC GG G 3TGTGGCAGG CG GG 3GG7GTGCGoCCGG ATGC ^G7G 22 u 0 
7 Z G A A 7 W Q A A G V R P D A 
25 ".TGGGCGATTCGGAGGGTGAGATGGGCGGAGCTTGTGTGGCGGGTGGGGT 2 2 5 0 
I G H S Q G E I A A A C 7 A G A V 
GTGA37ACGCGA7GCCGCCCGGATCGTGACCTTGCGCAGCCAGGCGA7CG 2 4 00 

2 L R D A A R I V ? L R 5 Q A I 
CCCG3GGCCTGGCGGGCCGGGGCGCGATGGCATCCGTCGCCCTGCCCGCG 2 4 50 
30 A R G L A G R G A M A S V ALP A 

CAGGATGTCGAGCTGGTCGACGGGGCCTGGATCGCCGCCCACAACGGGCC 2 500 

Q 2 V E L 7 D G A W I A A H M G P 
3 G C 3 7 3 3 AC C G 7 G A 7 C 3 3 G G GC A. C C C C G G AAG C G G T C G AC C A T G T C C 7 2 A 2 5 50 
A S T 7 I A G T P E A V D K 7 L 
^> 5 2 C 3GT2P.TG AGGC.A 2PGaGGGGTGG ^ GGTG CGGGu vj. 1 -. 2 w/vCCuTCGAC i .AT 2 603 
7 A H E A 2 G 7 P V R P. I T V D V 
22 272 32 AG AG 333 3 3 A3C7CGAGCTGATCCGCG ACG AACTACTCGACAT 2 650 
A 3 H T P H 7 E 3 I R D E L L D I 
37 ~.;;CG7v3A.GCA.G C2 1 CGCAGA*CCGCGCTCGTGCGG i ^GCTCTCGrv-CG 2 7 GC 
40 7 S D 3 S 5 Q T P L V P W L S T 

7GGA3GGCACC?GGG7CGACAGCCCGCTGGACGGGGAGTAiCTGGTACC GG 27 50 
7 0 G ? W 7 D S PLDGEY'WY P 
AACCTGCGTGAACCGGTCGGTTTCCACCCCGCCGTCAGCCAGTTGCAGGC 2300 
N 2 R E P 7 G F H P A V S Q L 0 A 
45 CCAGGG0GACACCGTG7TCGTCGAGGTCAGCGCCAGCCCGGTGTTGTTGC 2 8 50 
Q G D T 7 E 7 E V S A 3 ? 7 L L 
AGGCGATGGACGACGATGTCGTCACGGTTGCCACGC7GCGTCGTGACGAC 2 900 
<2 A M D D D V 7 T 7 A T L R R D D 
G G C G A G G 2 C A C 3 C G G A T G C T C A C C G C C C T G G C A C A G G C 0 7 A T G T C G A C G G 2 950 
50 G 2 A T P M L T A L A Q A Y 7 H G 

CG7GACGGTCGACTGGCGCGCCA7CCTCGGCACCACCACAACCCGGG7AG 30 00 

7 7 7 D W P A I L G T T T T R V 
7 3 G AG G T T C C G AC C T AC G CC T T C C AG3C AC C AG C G G T AC T G G C T G G AG T C G 30 50 
L 2 2 P 7 Y A F Q ;1 Q R ':' W 3 E 2 
0 0 o 2722CCCGGCCACGGCCGACTCGGGCCACCCCG7GCT GG'j^AGCGGAjjT J - 0 0 
A P P A T A D S G H ? V L G ? G 7 

A 7 A 3 S r G P 7 E T G P V P A 
G 7 G G G G ACC G C G CGG T G T TC A T C G C C G AAC T G G C G C TC G C CG C CG C C G AC 32 CO 
60 G A D R A 7 E 2 A E L A L A A A D 
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AA. Z C G 3 2 3 ■ j ■ j 



A V? T L H A E G V LP? G 3 7 F 3 

o - „ ^ o .-1.'- - 'o ^ ^ i G j w ~. v. o l. \w - j'oCClCCGvv-'o j;^ ~ j _ o G * ■ j ^ ^ — ■ j ? ^ s. ~t ~* v _< 

10 F E A V D 7 A W ? ? ? G A V ? A 

."■.^ijijuC i G --'^.CLj^G GCGTGGCGACGCGCGG^.CCr-.vj o i GTGC 1 ^ - z~ oO 

7 G L ? G A W R R A D A; y F V G i 

~ G S F G G F V AH? G LOG 

7 F S A 7 GOGS R Q P G G W r 
.^307CGCGG7GCA.CGGG7 GGGAGGGCG.CCGTG GGG GGCGCCT G J 57 GAG G 3 o b 0 
G L A V H A S D A T V G 3 A G G G 
- j-CG w j.ACAGTGGGGTGGTGGAGGTCGCCGGCGGGvj^-.^GGTG 1 ^ ~ jur-", r 3 00 
20 .-. R 0 £ G 7 V £ L A A F G G A G ■•■ 

ji.i-u'ji ijC?GACGGGG'o.--GTGoGTGAGGCTgGG'~o^'^G iCG.u. ^^-jw.-.-o ^ i> G 

P7GTAGGVTLGZ7A3A 
-CGGA.TCCGACGA/GGCGGA.CGGTCTGCTTCGGGTG GAGTGG7TG3GGG7G j- 8 0 0 
GGS D E S D G L L R 7 E W 3 F 
-3 GGGGAGGCCCACTAGGAGGGTGCGGACGAGGTGGGCGAGGGGGAGAGGGG 3 3 :>C 
A E A H V D G A D E L P E G 3 7 L 
CATCACCGCCACACACCCCGACGACCCCGACGACCCCACCAACCCOCACA 3 900 

I T A T H ? D D P D D P T N P H 
ACACACCCACACGCACCCACACACAAACCACACGCG7CC7CACCG0GC7G 3 950 
30 t-'TPTRTHTQTTRVLT A L 

CAACACCACCTCATCACCACCAACCACACCCTCATCGTCCACACCACCAC 4 000 

QHHLITTNHTLIVHTT? 
CGA33CCCCCAGGCGGCGCCGTCACCGGCCTCACCCGCACCGCACAAAACG 4 0 50 
D ? P G A A V T G L T R T A Q N 
35 AA C AG G G G G G C C G GAT C C AG G T C AT C G AAAC C G A G C AC C G G G .AC A G G G G A -1 100 
E H P G R I H L I E T H H P H 7 P 
CTCCCCCTCACCCAA.CTCACCACCCTCCACCAACCCGACCTACGOGTGAC 4 150 

3 P G T G L T T G H 0 P H G F G G 
GAACAACACCCTCCACACCCCCCACCTCACCCGCATCAGCACGGAOGAGA 4 2 50 
40 G s; T L H T P H L T P I T T A H 

A C A G G A C G A G AG\ C C A C C C C C AA CACCCCACCCCTC AA G C C C AA* C C A G G G G 4 2 50 
N T ? T T T P N T P P L N ? M H A 
ATCCTCATCACCGGCGGCTCCGGCACCCTCGCCGGCATCCTCGCCCGCCA 4 300 
I L I T G G S G T L A G I LARK 
45 CCTCAACCACCCCCACACCTACCTCCTCTCCCGCACACCACCACCCCCCA 4 350 
LNHPHTYLLSRT PPPP 
GGACAGCCGGCACCCACATCCCCTGCGACCTCAGCGACCGGAGCGAAA.TG 4-^00 
TTPGTH I PCDLT DPTQI 
A G G C A.A G C C C T C A G G G A C A T A C C A C AAV C C C C T C A C G G G C A T C T 7 G C A C A C 4 4 50 
50 7 Q A L T H I P Q P L 7 G I F H T 

3 G G C G C C A C C C T C G A G G A G G G G A C C C T C A C C A A. C C 7 C A G G C G G C AA G A C G 4 500 

A ATLDDATLT M 3 T F Q K 
7GACCAGCACGCTCGAA.CCCA.AAGCCGACGCCGGG7GGGAGCTGGAGGAG 4 5 50 
3 7 7 T L Q P K A D A A W H L H K 
55 3ACACCCAAG3ACCAACCCCTCACCCA0TTCGT:C7CTA7TCCAGCGC0GC 4 000 
A 7 0 N Q - ? L 7 H F V G 7 S S A A 

AG GGG PGQAN 3 A A A Li A 
7CCTCGACGCCCTCGCCACCCACCGCCACACCCAAGGACAACCCGCCACC 4 7 00 
60 FLDALATHRHTQGQPAT 
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Example 3 

Recombinant PKS Genes for 13-desmethoxy FK -506 and FK-520 

10 The present invention provides a variety of recombinant PKS genes in addition to 

those described in Examples 1 and 2 for producing 13-desmethoxy FK-506 and FK-520 
compounds. This Example provides the construction protocols for recombinant FK-520 and 
FK-506 (from Streptomyces sp. MA685S (ATCC 55098). described in U.S. Patent Nos. 
5,1 16,756. incorporated herein by reference) PKS genes in which the module 8 AT coding 

1 5 sequences have been replaced by either the rapAT3 (the AT domain from module 3 of the 
rapamycin PKS), m/?AT12. eryATl (the AT domain from module I of the erythromycin 
(DEBS) PKS), or 6wAT2 coding sequences. Each of these constructs provides a PKS that 
produces the 13-desmethoxy- 13 -methyl derivative, except for the rapAT12 replacement, 
which provides the 13-desmethoxy derivative, i.e., it has a hydrogen where the other 

20 derivatives have methyl. 

Figure 7 shows the process used to generate the AT replacement constructs. First, a 
fragment of -4.5 kb containing module 8 coding sequences from the FK-520 cluster of 
ATCC 14891 was cloned using the convenient restnction sues Sad and Sph\ (Step A in 
Figure 7). The choice of restriction sites used to clone a 4.0 - 4.5 kb fragment comprising 

25 module S coding sequences from other FK-520 or FK-506 clusters can be different 

depending on the DNA sequence, but the overall scheme is identical. The unique Sacl and 
Sphl restriction sites at the ends of the FK-520 mod- 1 ? 8 fragment were then changed to 
unique Bgl II and Nsil sites by ligation to synthetic linkers (described in the preceding 
Examples, see Step B of Figure 7). Fragments containing sequences 5' and 3 1 of the AT8 

30 sequences were then amplified using primers, described above, that introduced either an 

-417*11 site or an N/iel site at two different KS/AT boundaries and an Xlwl site at the AT/DFI 
boundary ( Step C of Figure 7). Heterologous AT domains from the rapamycin and 
erythromycin gene clusters were amplified using primers, as described above, that 
introduced the same sites as just described (Step D of Figure 7 ). The fragments were ligated 

35 to give hybrid modules with in-frame fusions at the KS/AT and AT/DH boundaries (Step E 
of Figure 7). Finally, these hybrid modules were ligated into the BamHl and Pstl sites of the 
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K.C515 vector. The resulting recombinant phage were used to transform the FK-506 and 
FK-520 producer strains to yield the desired recombinant ceils, as described in the 
preceding Examples. 

The following table shows the location and sequences surrounding the engineered 
site of each of the heterologous AT domains employed. The FK-506 hybrid construct was 
used as a control for the FK-520 recombinant cells produced, and a similar FK-520 hybrid 
construct was used as a control for the FK-506 recombinant cells. 



.linooii^ AT 


En7vme 


Location of Engineered Site 


FKoOb ATS 


AvrW 


n ^ ^ r-* r~ _ ^ - ■> -n r~- .'~» ^ ^ r~ T / ~* rT1 ^ ^* T ^ rn * T 1 


(hydroxymalonyl) 


Nhel 


i 

i - 0 r \ \? K I V S b " 

' ACCCAGCATCC7GCGA7GGGTGAGCGqctcqcC 




t q h f g e r e a 






> TACGCCTTCCAGG 3GG GGCCCTACTGGat: caaa 




Xhn\ 

J\ I i \J \ 


1 . - 

Y A F Q r. R F Y W I E 


. 

rap am vein AT3 


AvrW 


G ACCGG c c cc g + i ~ v.; GoG GGG CGTuTCGTCC TTC 


(methylmalonyl) 


Nhel 


D R ? R R A G V S S r 
TGGG AGTGGCT wi'^'joAT G'-joCAGTGCccti cjcqu 

W Q W L G M G S A L R 
TACGCCTTCCAACACCAGCGGTACTGGqt caaq 






Y A t y H Q K I W V L 




Avrll 


GGCCGAqcqcqcCGGGCAGGCGTGTCGTCCTTC 


(malonyl) 


Nhel 


G R A R R A G V S S F 
TCGCAGCGTGCTGGCATGGGTGAGGAact qqcC 

S C P- A G M G E E L A 
TACGCCTTCCAGGACCAGCGCTACTGGctccaa 




Xhol 


Y A F G :i C R V W L 5 


DEBS ATI 


Avrll 


G C G G G A c c a c q c G G G G G G G 3 G G T C T C G T G G T T C 1 


(methylmalonyl) 


Nhel 


A RPR P. A G V S S F 
TGGCAGTGGGCGGGCATGGCCGTCGAcctactC 

W Q W A G M A V D L L 
TACCCGTTCCA:GCGCGAGCGCGTCTGGctcaaa j 




Xhol 


Y P F Q R £ R V W L E 


DEBS AT 2 


Avrll 


GACGGGat acqcCGGGCA.GGTGTGTCGGCGTTC 


(methylmalonyl) 


Nhel 


D G V R R A G V S A F 
GCCCAGTGGGAA.GGCATGGCGCGGGAqt t qt nG 

A Q W E G M A R E L L 
7 A T C C T T T C C A. G G G C AA G C G G T T C T G G c t a c t c 




Xhol 


Y F F C G K R F W L L 



The sequences shown below provide the location of the KS/AT boundanes chosen 
in the FK-520 module S coding sequences. Regions where .-hrll and Nhel sites were 
engineered are indicated by lower case and underlining. 
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ACACCA 
IDT 



10 7 ? 7 V D R V A V A Q T L A R P. T H F A 

~. 7 C G C G 7 CG 7 G CT G C T CG 3 T G AC AC C G T C A T C AC Chl. ^ C i ^ C ^ v_ ^ ^ _ o o ^ C C G AC G 
H R A V L L G D T V I T T ? P A u R P D 

15 Y7GCCGC7CATCC7GT37T7GCCGACGCCT7 AC.. J,- . 7 7 7 G 7 7 7 7777 G AC AA C C 

A A A H P V 7 A D A W H fcl A 7 7 r 7 G H 

The sequences shown below provide the location of the AT/DH boundary chosen in 
the FK-520 module 8 coding sequences. The region where an Xlw\ site was engineered is 
20 indicated by lower case and underlining. 

A 7 7 A 7 'I ' G G a c c g a a T C G G C A. C G C C C G G C C G 7 A 7 C G C CoTljo^C C A 7 _ _ ^ 7 d C T G G G C T 
K Y w I PSA R ? A A S D A G H P 7 L G 

25 

The sequences shown below provide the location of the KS/AT boundaries chosen 
in the FK-506 module 8 coding sequences. Regions where Avrll and Nhel sites were 
engineered are indicated by lower case and underlining. 

7CGGCCAGGGGGTGGCCGCGGACCGGCCGTC2£££CCGTGCGGCGGT7CCGTCGTTCGGG 
30 7 A R F W P R ? G R ? P. R A A V 5 S F G 
G T G A 7; C G G C A C C AA C G C C C A C A T C .A T C C T G G A G G C C G G A C C L G A C C A* G ^ .-vo G A <o C C G T C G 

V S G T N A r. I I L 7 A G ? D Q F F P S 
G 7/vGAA/C7GGCCGG7G;7C777C7GCTGC7CG7GTCGGC.ACGG7CC7 -rv- ^CAlTGGAC 
AFPAGDLP7 7 7SARSPFALD 
35 G AGC AG A7 CG GGCGG CTGCG CG ACT A.TCTCG AC GCCGCCCCCGG7GT 7 G AG G7GGCGGCC 
7 Q Z G R L R D Y L 7 A A ? G V - ~ A A 
GTGGCGCGGACACTGGCCACGCGTA.CGCACTCC7CCCACCGCGCCGTA.ee GCTCGGTG AC 



ARTLATPT 



HPS H P A V 7 L G D 



ACCGTCATCACCGCTCCCCCCGCGGA^CAGCCGGGCGAGCTCGTCTTCGCCCACTCC 
40 TVITAPPVEQPGELVF V Y S G 
CAGGGCACCCAGCATCCCGCGATGGGTGAGCGactcocCGCAGCCTTCCCCGTGTTCGCC 

£ G T Q H P A M G E R L A A A F ? V F A 
GACCCGGACGTACCCGCCTACGCCTTCCAGCGGCGGCCCTACTGGATCGAGTCCGCGCCG 

C ? D V P A Y A r Q R R P V W I E S A P 



The sequences shown below provide the location of the AT/DH boundary chosen in 
the FK-506 module 8 coding sequences. The region where an Xlwl site was engineered is 
indicated by lower case and underlining. 



' C C AG C G 7 7 G G 7 C C T ACT G G 



50 p 7 V P A Y A 7 Q 7 P r Y W IPS A. 
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Replacement of Methoxyl w ith Hyd rogen or Methvl at C-15 of FK-506 and FK-520 
The methods and reagents of the present invention also provide novel FK-506 and 
FK-520 derivatives m which the methoxy group at C- 1 5 is replaced by a hydrogen or 
methyl. These derivatives are produced in recombinant host cells of the invention that 
5 express recombinant PKS enzymes the produce the derivatives. These recombinant PKS 
enzymes are prepared m accordance with the methodology of Examples 1 and 2, with the 
exception that AT domain of module 7, instead of module S, is replaced. Moreover, the 
present invention provides recombinant PKS enzymes in which the AT domains of both 
modules 7 and S have been ^umgcd. The table below summarizes the various compounds 
10 provided by the present invention. 





Compound 


C-13 


C-15 




FK-506 


hydrogen 


hydrogen 




FK-506 


hydrogen 


methoxy 


15 


FK-506 


hydrogen 


methyl 




FK-506 


methoxy 


hydrogen 




FK-506 


methoxy 


methoxy 




FK-506 


methoxy 


methyl 




FK-506 


methyl 


hydrogen 


20 


FK-506 


methyl 


methoxy 




FK-506 


methyl 


methyl 




FK-520 


hydrogen 


hydrogen 




FK-520 


hydrogen 


methoxy 




FK-520 


hydrogen 


methyl 


25 


FK-520 


methoxy 


hydrogen 




FK-520 


methoxy 


methoxy 




FK-520 


methoxy 


methyl 




FK-520 


methyl 


hydrogen 




FK-520 


methyl 


methoxy 


30 


FK-520 


methvl 


methyl 



Derivative Provided 

13. 1 5-didesmerhoxy-FK-506 

1 3-desmethoxy-FK-506 

13,1 5-didesmethoxy- 1 5-methyl-FK-506 

1 5-desmethoxy-FK-506 

Original Compound — FK-506 

1 5-desmethoxy- 1 5-methyl-FK~506 

1 3, 1 5-didesmethoxy- 1 3-methyl-FK-506 

1 3-desmethoxy- 1 3-methyl-FK-506 

13,1 5-didesmethoxy- 1 3 , 1 5-dimethy t-FK-506 

13, 1 5-didesmethoxy FK-520 

1 3-desmethoxy FK-520 

1 3 , 1 5-didesmethoxy- 1 5-methyl-FK-520 

1 5-desmethoxy-FK-520 

CHginal Compound — FK-520 

1 5-desmethoxy- 1 5-methyl-FK-520 

1 3 , 1 5-didesmethoxy- 1 3-methyl-FK-5 20 

1 3-desmethoxy- 13-methyl-FK-520 

13,15-didesmethoxy-13,15-dimethyl-FK-520 



Example 5 

Replacement of Methoxyl with Ethvl at C-13 and/or C-15 of FK-506 and FK-520 
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The present invention also provides novel FK-506 and FK-520 derivative 
compounds in which the methoxy groups at either or both the C-13 and C-15 positions are 
instead ethyl groups. These compounds are produced by novel PKS enzvmes of the 
invention in which the AT domains of modules 8 and or 7 are convened to ethylmalonvl 
specific AT domains by modification of the PKS gene that encodes the module. 
Ethylmalonvl specific AT domain coding sequences can be obtained from, for example, the 
FK-520 PKS genes, the niddamycin PKS genes, and the tyiosin PKS genes. The novel PKS 
genes of the invention include not only those in which either or both of the AT domains of 
modules 7 and 8 have been converted to ethylmalonvl specific AT domains but also those in 
which one of the modules is converted to an ethylmalonv l specific AT domain and the other 
is converted to a malonyl specific or a methyimalonyl specific AT domain. 

Example 6 
Neurotrophic Compounds 
The compounds described in Examples 1 - 4, inclusive have immunosuppressant 
activity and can be employed as immunosuppressants in a manner and in formulations 
similar to those employed for FK-506. The compounds of the invention are generally 
effective for the prevention of organ rejection in patients receiving organ transplants and in 
particular can be used for immunosuppression following orthotopic liver transplantation. 
These compounds also have pharmacokinetic properties and metabolism that are more 
advantageous for certain applications relative to those of FK-506 or FK-520. These 
compounds are also neurotrophic; however, for use as neurotrophins, it is desirable to 
modify the compounds to diminish or abolish their immunosuppressant activity. This can be 
readily accomplished by hydroxylating the compounds at the C-18 position using 
established chemical methodology or novel FK-520 PKS genes provided by the present 
invention. 

Thus, in one aspect, the present invention provides a method for stimulating nerve 
growth that comprises administering a therapeutically effective dose of 18-hydroxy-FK- 
520. In another embodiment, the compound administered is a C-lS,20-dihydroxy-FK-520 
derivative. In another embodiment, the compound administered is a C-13-desmethoxy 
and/or C-15-desmethoxy 18-hydroxy-FK-520 derivative. In another embodiment, the 
compound administered is a C-13-desmethoxy and/or C-15-desmethoxy 18,20-dihydroxy- 
FK-520 derivative. In other embodiments, the compounds are the corresponding analogs of 
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FK-506. The 18-hydroxy compounds of the invention can be prepared chemically, as 
descnbed in U.S. Patent No. 5,1 89,042, incorporated herein by reference, or by 
fermentation of a recombinant host ceil provided by the present invention that expresses a 
recombinant PKS in which the module 5 DH domain has been deleted or rendered non- 
5 functional. 

The chemical methodology is as follows. A compound of the invention (-200 mg) is 
dissolved in 3 mL of dry methylene chloride and added to 45 jiL of 2,6-lutidine, and the 
mixture stirred at room temperature. After 10 minutes, tert-butyldimethylsilyl 
trifluoromethanesulfonaic (64 |iL) is added by suinge. After 15 minutes, the reaction 

1 0 mixture is diluted with ethyl acetate, washed with saturated bicarbonate, washed with bnne, 
and the organic phase dried over magnesium sulfate. Removal of solvent in vacuo and flash 
chromatography on silica gel (ethyl acetate; hexane (1:2) plus 1% methanol) gives the 
protected compound, which is dissolved ir °5% ethanol (2.2 mL) and to which is added 53 
jiL of pyridine, followed by selenium dioxide (58 mg). The flask is fitted with a water 

1 5 condenser and heated to 70°C on a mantle. After 20 hours, the mixture is cooled to room 
temperature, filtered through diatomaceous earth, and the filtrate poured into a saturated 
sodium bicarbonate solution. This is extracted with ethyl acetate, and the organic phase is 
washed with brine and dried over magnesium sulfate. The solution is concentrated and 
purified by flash chromatography on silica gel (ethyl acetate: hexane (1:2) plus 1% 

20 methanol) to give the protected 18-hydroxy compound. This compound is dissolved in 
acetonitrile and treated with aqueous HF to remove the protecting groups. After dilution 
with ethyl acetate, the mixture is washed with saturated bicarbonate and bnne, dried over- 
magnesium sulfate, filtered, and evaporated to yield the 18-hydroxy compound. Thus, the 
present invention provides the C-l 8-hydroxyl derivatives of the compounds described in 

25 Examples 1 - 4. 

Those of skill in the ail will recognize that other suitable chemical procedures can be 
used to prepare the novel 18-hydroxy compounds of the invention. See, e.g., Kawai etaL< 
Jan. 1993, Structure-activity profiles of macrolactam immunosuppressant FK-506 
analogues, FEBS Letters 316(2): 107-1 13, incorporated herein by reference. These methods 

30 can be used to prepare both the CT8-[S]~OH and Cl8-[/f]-OH enantiomers, with the R 

enantiomer showing a somewhat lower ICso, which may be preferred in some applications. 
See Kawai et a/., supra. Another preferred protocol is descnbed in Umbreit and Sharpless, 
1977, J ACS 99(16): 1526-28, although it may be preferable to use 30 equivalents each of 
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Se0 2 and t-BuOOH rather than the 0.02 and 3-4 equivalents, respectively, described m that 
reference. 

All scientific and patent publications referenced herein are hereby incorporated by 
reference. The invention having now been described by way of written description and 
example, those of skill in the art will recognize that the invention can be practiced in a 
variety of embodiments, that the foregoing description and example is for purposes of 
illustration and not limitation of the following claims. 
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Claims 

1 . An isolated nucleic acid that encodes a CoA ligase, a non-ribosomal peptide 
synthetase, or a domain of an extender module of a poiyketide synthase enzyme that 
synthesizes FK-520. 

2. The isolated nucleic acid of claim 1 that encodes an extender module, said 
module comprising a ketosynthase domain, an acyl transferase domain, and an acyl carrier 
protein domain. 

3. The isolated nucleic acid of claim 1 that encodes an open reading frame, said 
open reading frame comprising coding sequences for two or more extender modules, each 
extender module comprising a ketosynthase domain, an acyl transferase domain, and an 
acyl carrier protein domain. 

4. The isolated nucleic acid of claim 1 that encodes a gene cluster, said gene cluster 
comprising two or more open reading frames, each of said open reading frames comprising 
coding sequences for two or more extender modules, each of said extender modules 
comprising a ketosynthase domain, an acyl transferase domain, and an acyl carrier protein 
domain. 

5. The isolated nucleic acid of claim 2, wherein at least one of said domains is a 
domain of a module of a non-FK-520 poiyketide synthase. 

6. The isolated nucleic acid of claim 1 , wherein said nucleic acid is a recombinant 
vector capable of replication in or integration into the chromosome of a host cell. 

7. The isolated nucleic acid of claim 6 that is selected from the group consisting of 
cosmid pKOS034-120, cosmid pKOS034-124, cosmid pKOS065-M27, and cosmid 
pKOS065-M2l. 

8. The isolated nucleic acid of claim 5, wherein said non-FK-520 poiyketide 
synthase is rapamycin poiyketide synthase, FK-506 poiyketide synthase, or erythromcyin 
poiyketide synthase. 
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9. A method of preparing a polyketide, said method comprising transforming a 
host ceil with a recombinant DNA vector of claim 6, and culturing said host cell under 
conditions such that said polyketide synthase is produced and catalyzes synthesis of said 
polyketide. 

10. The method of claim 9, wherein said host cell is a Streptomyces host cell. 

11. ine method of claim 9, wherein said polyketide is selected from the group 
consisting of FK-520* 1 3-desmethoxy-FK-520, and 13-desmethoxy-FK-506. 

12. A recombinant host cell that expresses a recombinant polyketide synthase 
selected from the group consisting of: (i) an FK-520 polyketide synthase in which at least 
one AT domain is replaced by an AT domain of a non-FK-520 polyketide synthase; (ii) an 
FK-506 polyketide synthase in which at least one AT domain is replaced by an AT domain 
of a non-FK-506 polyketide synthase; (iii) an FK-520 polyketide synthase in which at least 
one DH domain has been deleted; (iv) an FK-506 polyketide synthase in which at least one 
DH domain has been deleted. 

13. The recombinant host cell of claim 12 that expresses an FK-520 polyketide 
synthase in which an AT domain of module 8 has been replaced by an AT domain that 
binds malonyl CoA, methylmalonyl CoA, or ethylmalonyl CoA. 

14. The recombinant host cell of claim 12 that expresses an FK-506 polyketide 
synthase in which an AT domain of module 8 has been replaced by an AT domain that 
binds malonyl CoA, methylmalonyl Coa, or ethylmalonyl CoA. 

15. The recombinant host cell of claim 13, wherein a DH domain of module 5 or 
module 6 has been deleted. 

16. The recombinant host cell of claim 14, wherein a DH domain of module 5 or 
module 6 has been deleted. 
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17. A recombinant host cell that comprises recombinant genes coding for enzymes 
sufficient for synthesis of ethylmalonyl CoA or 2-hydroxymalonyI CoA. 

18. A polyketide having the stmcture 




wherein, R| is hydrogen, methyl, ethyl, or ally]; R 2 is hydrogen or hydroxyl, provided that 
when R 2 is hydrogen, there is a double bond between C-20 and C-19; R 3 is hydrogen or 
10 hydroxyl; R4 is methoxyl, hydrogen, methyl, or ethyl; and Rs is methoxyl, hydrogen, 

methyl, or ethyl; but not including FK-506, FK-520, lS-hydroxy-FK-520, and 18-hydroxy- 
FK-506. 

19. The polyketide of claim IS that is 13-desmeihoxy-FK-506. 

15 

20. The polyketide of claim 18 that is 13-desmethoxv-l 8-hydroxy-FK-520. 
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ff<i?C (C-terminus) 



! 



fkbL 



I fkbK hydroxybutyryl-CoA dehydrogenase 

r fkbJ ? 

fkbl crotonyl-CoA reductase 

fkbH ? 

fkbG O-methyltransrerase 

fkbF dicarboxylate carrier (membrane) protein 

fkbE hydroxybutyryl-CoA dehydratase 
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